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A NEW CLASS OF ENZYMES IN THE BIOSYNTHET1C PATHWAY FOR THE 
PRODUCTION OF TRIACYLGLYCEROL AND RECOMBINANT DNA 
MOLECULES ENCODING THESE ENZYMES 

5 The present invention relates to the isolation, identification and characterization 
of recombinant DNA molecules encoding enzymes catalysing the transfer of 
fatty acids from phospholipids to diacylglycerol in the biosynthetic pathway for 
the production of triacylgiycerol. 

) 10 Triacylgiycerol (TAG) is the most common lipid-based energy reserve in nature. 

The main pathway for synthesis of TAG is believed to involve three sequential 
acyl-transfers from acyl-CoA to a glycerol backbone (1 < 2). For many years, 
acyl-CoA : diacylglycerol acyltransferase (DAGAT), which catalyzes the third 
acyl transfer reaction, was thought to be the only unique enzyme involved in 
15 TAG synthesis. It acts by diverting diacylglycerol (DAG) from membrane lipid 
synthesis into TAG (2). Genes encoding this enzyme were recently identified 
both in the mouse (3) and in plants (4, 5), and the encoded proteins were 
shown to be homologous to acyl-CoA : cholesterol acyltransferase (ACAT). It 
was also recently reported that another DAGAT exists in the oleaginous fungus 
20 Mortierella ramanniana, which is unrelated to the mouse DAGAT, the ACAT 

) gene family or to any other known gene (6). 

The instant invention relates to novel type of enzymes and their encoding 
genes for transformation. More specifically, the invention relates to use of a 

25 type of genes encoding a not previously described type of enzymes hereinafter 
designated phospholipidrdiacylglycerol acyltransferases (PDAT), whereby this 
enzyme catalyses an acyl-CoA-independent reaction. The said type of genes 
expressed alone in transgenic organisms will enhance the total amount of oil 
(triacylglycerolsy produced in the cells. The PDAT genes, in combination with a 

30 gene for the synthesis of an uncommon fatty acid will, when expressed in 
transgenic organisms, enhance the levels of the uncommon fatty acids in the 
triacylglycerols. . 
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There is considerable interest world-wide in producing chemical feedstock, 
such as fatty acids, for industrial use from renewable plant resources rather 
than non-renewable petrochemicals. This concept has broad appeal to 
manufacturers, and consumers on the basis of resource conservation and 
provides significant opportunity to develop new industrial crops for agriculture. 

There is a diverse array of unusual fatty acids in oils from wild plant species 
and these have been well characterised. Many of these acids have industrial 
potential and this has led to interest in domesticating relevant plant species to 
enable agricultural production of particular fatty acids. 

Development in genetic engineering technologies combined with greater 
understanding of the biosynthesis of unusual fatty acids now makes it possible 
to transfer genes coding for key enzymes involved in the synthesis of a 
particular fatty acid from a wild species into domesticated oilseed crops. In this 
way individual fatty acids can be produced in high purity and quantities at 
moderate costs. 

In all crops like rape, sunflower, oilpalm etc., the oil (he. triacylglycerols) is the 
most valuable product of the seeds or fruits and other compounds like starch, 
protein, and fibre is regarded as by-products with less value. Enhancing the 
quantity of oil per weight basis at the expense of other compounds in oil crops 
would therefore increase the value of crop. If genes, regulating the allocation of 
reduced carbon into the production of oil can be up-regulated, the cells will 
accumulate more oil on the expense of other products. Such genes might not 
only be used in already high oil producing cells, such as oil crops, but could 
also induce significant oil production in moderate or low oil containing crops 
such as e.g. soy, oat, maize, potato, sugarbeats, and turnips as well as in 
micro-organisms. 
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Summary of the invention 

Many of the unusual fatty acids of interest, e.g. medium chain fatty acids, 
hydroxy fatty acids, epoxy fatty acids and acetyienic fatty acids, have physical 

5. properties that are distinctly different from the common plant fatty acids. The 
present inventors have found that, in plant species naturally accumulating 
these uncommon fatty acids in their seed oil (i.e. triacylglycerol), these acids 
are absent, or present in very low amounts in the membrane (phospho)lipids of 
the seed. The low concentration of these acids in the membrane lipids is most 

10 likely a prerequisite for proper membrane function and thereby for proper cell 
functions. One aspect of the invention is that seeds of transgenic crops can be 
made to accumulate high amounts of uncommon fatty acids if these fatty acids 
are efficiently removed from the membrane lipids and channelled into seed 
triacylglycerols. 

15 : 
The inventors have identified a novel class of enzymes in plants catalysing the 
transfer of fatty acids from phospholipids to diacylglycerol in the production of 
triacylglycerol through an acyl-CoA-independent reaction and that these 
enzymes (phospholipid:diacylglycerol acyltransferases, abbreviated as PDAT) 

20 are involved in the removal of hydroxylated, epoxygenated fatty acids, and 
probably also other uncommon fatty acids such as medium chain fatty acids, 
from phospholipids in plants. 

This enzyme reaction was shown to be present in microsomal preparations 
from baker's yeast {Saccharomyces cerevisiae). The instant invention further 

25 pertains to an enzyme comprising an amino acid sequence as set forth inSEQ 
ID No. 2 or a functional fragment, derivate, allele, homolog or isoenzyme 
thereof. A so called .knock out' yeast mutant, disrupted in the respective gene 
was obtained and microsomal membranes from the mutant was shown to 
totally lack PDAT activity. Thus, it was proved that the disrupted gene encodes 

30 a PDAT enzyme (SEQ ID NO. 1 and 2). Furtherm, this PDAT enzyme is 
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characterized through the amino acid sequence as set forth in SEQ ID NO 2 
containing a lipase motif of the conserved sequence string FXKWVEA. 
The instant invention pertains further to an enzyme comprising an amino acid 
sequence as set forth in SEQ ID NO. 1a, 2b or 5a or a functional fragment, 

5 derivate, allele, homolog or isoenzyme thereof. 

Further genes and/or proteins of so far unknown function were identified and 
are contemplated within the scope of the instant invention. A gene from 
Schizosaccharomyces pombe, SPBC776.14 (SEQ ID. NO. 3), a putative open 
reading frame CAA22887 of the SPBC776.14 (SEQ ID NO. 13) were identified. 

io. Further Arabidopsis thaliana genomic sequences (SEQ ID NO. 4, 10 and 11) 
coding for putative proteins were identified, as well as a putative open reading 
frame AAC80628 from the A. thaliana locus AC 004557 (SEQ ID NO. 1 4). and a 
putative open reading frame AAD10668 from the A. thaliana locus AC 003027 
(SEQ ID NO. 15) were identified. 

15 Also, a partially sequenced cDNA clone from Neurospora crassa (SEQ ID NO. 
9) and a Zea mays EST (Extended Sequence Tac) clone (SEQ ID NO. 7) and 
corresponding putative amino acid sequence (SEQ ID NO. 8) were identified. 
Finally, two cDNA clones were identified, one Arabidopsis thaliana EST (SEQ 
ID NO. 5 and corresponding predicted amino acid sequence SEQ ID NO. 6) 

20 and a Lycopersicon esculentum EST clone (SEQ ID NO. 12) were identified. 
Further, enzymes designated as PDAT comprising an amino acid sequence 
selected from the group consisting of sequences as set forth in SEQ ID NO 2a, 
3a, 5b, 6 or 7b containing a lipase motif FXKWVEA are contemplated within the 
scope of the invention. Moreover, an enzyme comprising an amino acid 

25 sequence encoded through a nucleotide sequence, a portion, derivate, allele or 
homolog thereof selected from the group consisting of sequences as set forth 
in SEQ ID No. 1 , 1 b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7, 8b, 9, 9b, 1 0, 1 0b, 1 1 , 1 1 b or 
12 or a functional fragment, derivate, allele, homolog or isoenzyme of the 
enzyme encoding amino acid sequence are included within the scope of 'the 

30 invention. 
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A functional fragment of the instant enzyme is understood to be any 
polypeptide sequence which shows specific enzyme activity of a 
phospholipidrdiacylglycerol acyltransferase (PDAT). The length of the functional 
fragment can for example vary in a range from about 660 ± 10 amino acids to 

5 660 ± 250 amino acids, preferably from about 660 ± 50 to 660 +100 amino 
acids, whereby the „basic number" of 660 amino acids corresponds in this case 
to the polypeptide chain of the PDAT enzyme of SEQ ID NO. 2 encoded by a 
nucleotide sequence according to SEQ ID NO. 1. Consequently, the „basic 
number" of functional fullength enzyme can vary in correspondence to the 

to encoding nucleotide sequence. 

A portion of the instant nucleotide sequence is meant to be any nucleotide 
sequence encoding a polypeptid which shows specific activity of a 
phospholipid:diacylgiycerol acyltransferase (PDAT). The length of the 
nucleotide portion can vary in a wide range of about several hundreds of 

15 nucleotides based upon the coding region of the gene or a highly conserved 
sequence. For example the length varies in a range form about 1900 ± 10 to 
1900 ± 1000 nucleotides, preferably form about 1900 ± 50 to 1900 ±700 and 
more preferably form about 1900 ± 100 to 1900 ± 500 nucleotides, whereby the 
„basic number" of 1900 nucleotides corresponds in this case to the encoding 

20 nucleotide sequence of the PDAT enzyme of SEQ ID NO. 1. Consequently, the 
„basic number" of functional fullength gene can vary, 

An allelic variant of the instant nucleotide sequence is understood to be any 
different nucleotide sequence which encodes a polypeptide with a functionally 

25 equivalent function. The alleles pertain naturally occuring variants of the instant 
nucleotide sequences as well as synthetic nucleotide sequences produced by 
methods known in the art. Contemplated are even altered nucleotide 
sequences which result in an enzyme with altered activity and/or regulation or 
which is resistant against specific inhibitors. The instant invention further 

30 includes natural or synthetic mutations of the originally isolated nucleotide 
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sequences. These mutations can be substitution, addition, deletion, inversion 
or insertion of one or more nucleotides. 

A homologous nucleotide sequence is understood to be a complementary 
5 sequence and/or a sequence which specifically hybridizes with the instant 
nucleotide sequence. Hybridizing sequences include similar sequences 
selected from the group of DNA or RNA which specifically interact to the instant, 
nucleotide sequences under at least moderate stringency conditions which are 
known in the art. A preferred, non-limiting example of stringent hybridization 
10 conditions are hybridization in 6X sodium chlorida'sodium citrate (SSC) at 
about 45°C, followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50- 
65°C. This further includes short nucleotide sequences of e.g. 10 to 30 
nucleotides, preferably 12 to 15 nucleotides. Included are also primer or 
hybridization probes. 

15 

A homologous nucleotide sequence included within the scope of the instant 
invention is a sequence which is at least about 40%, preferably at least about 
50 % or 60%, and more preferably at least about 70%, 80% or 90% and most 
preferably at least about 95%, 96%, 97%, 98% or 99% or more homologous to 
20 a nucleotide sequence of SEQ ID NO. 1 . 

All of the aforementioned definitions are true for amino acid seqences and 
functional enzymes and can easily transfered by a person skilled in the art. 

Isoenzymes are understood to be enzymes which have the same or a similar 
25 substrate specifity and/or catalytic activity but a different primary structure. c 

In a first embodiment, this invention is directed to nucleic acid sequences that 
encode a PDAT. This includes sequences that encode biologically active 
PDATs as weir as sequences that are to be used as probes, vectors for 
30 transformation or cloning intermediates. The PDAT encoding sequence may 
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encode a complete or partial sequence depending upon the intended use. All or 
a portion of the genomic sequence, cDNA sequence, precursor PDAT or 
mature PDAT is intended. 

5 Further included is a nucleotide sequence selected from the group consisting of 
sequences set forth in SEQ ID No. 1 , 1b, 3, 3b, 4, 4a, 4b, 9b, 10, 10b or 1 1 or a 
portion, derivate, allele or homolog thereof. The invention pertains a partial 
nucleotide sequence corresponding to a fullength nucleotide sequence selected 
from the group consisting of sequences set forth in SEQ ID No. 5, 5b, 6b, 7, 8b, 
) to 9, 11b or 12 or a portion, derivate, allele or homolog thereof. Moreover, a 
nucleotide sequence comprising a nucleotide sequence which is at least 40% 
homologous to a nucleotide sequence selected form the group consisting of 
those sequences set forth in SEQ ID No. 1 1b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7, 8b, 
9, 9b, 10, 10b, 11, 11b or 12 is contemplated within the scope of the invention. 

15 

The instant invention pertains to a gene construct comprising a said nucleotide 
sequences of the instant invention which is operably linked to a heterologous 
nucleic acid. 

The term operably linked means a serial organisation e.g. of a promotor, coding 
20 sequence, terminator and/or further regulatory elements whereby each element 
. ) can fulfill its original function during expression of the nucleotide sequence. 

Further, a vector comprising of a said nucleotide sequence of the instant 
invention is contemplated in the instant invention. This includes also an 
25 expression vector as well as a vector further comprising a selectable marker 
gene and/or nucleotide sequences for the replication in a host cell and/or the 
integration into the genome of the host cell. 

In a different aspect, this invention relates to a method for producing a PDAT in 
30 a host cell or progeny thereof, including genetically engineered oil seeds, yeast 
and moulds or any other oil accumulating organism, via the expression of a 

> - 
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construct in the cell. Cells containing a PDAT as a result of the production of 
the PDAT encoding sequence are also contemplated within the scope of the 
invention. 

5 Further, the invention pertains a transgenic cell or organism containing a said 
nucleotide sequence and/or a said gene construct and/or a said vector. The 
object of the instant invention is further a transgenic cell or organism which is 
an eucaryotic cell or organism. Preferably, the transgenic cell or organism is a 
yeast cell or a plant cell or a plant. The instant invention further pertains said 

10 transgenic cell or organism having an altered biosynthetic pathway for the 
production of triacylglycerol. A transgenic cell or organism having an altered oil 
content is also contemplated within the scope of this invention. 

Further, the invention pertains a transgenic cell or organism wherein the activity 
15 of PDAT is altered in said cell or organism. This altered activity of PDAT is 
characterized by an alteration in gene expression, catalytic activity and/or 
regulation of activity of the enzyme. Moreover, a transgenic cell or organism is 
included in the instant invention, wherein the altered biosynthetic pathway for 
the production of triacylglycerol is characterized by the prevention of 
20 accumulation of undesirable fatty acids in the membrane lipids. 

In a different embodiment, this invention also relates to methods of using a 
DNA sequence encoding a PDAT for increasing the oil-content within a cell. 

25 Another aspect of the invention relates to the accommodation of high amounts 
of uncomman fatty acids in the triacylglycerol produced within a cell, by 
introducing a DNA sequence producing a PDAT that specifically removes these 
fatty acids from the membrane lipids of the cell and channel them into 
triacylglycerol. Plant cells having such a modification are also contemplated 

30 herein. 
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Further, the invention pertains a process for the production of triacylglyceroi, 
comprising growing a said transgenic cell or organism under conditions 
whereby the said nucleotide sequence is expressed and whereby the said 
transgenic cells comprising a said enzyme catalysing the transfer of fatty acids 
5 from phospholipids to diacylglycerol forming triacylglyceroi. 

Moreover, triacytglycerols produced by the aforementioned process are 
included in scope of the instant invention. 

) 10 Object of the instant invention is further the use of an instant nucleotide 
sequence and/or a said enzyme for the production of triacylglyceroi and/or 
triacylglycerols with uncommon fatty acids. The use of a said instant nucleotide 
sequence and/or a said enzyme of the instant invention for the transformation 
of any cell or organism in order to be expressed in this cell or organism and 
15 result in an altered, preferably increased oil content of this cell or organism is 
also contemplated within the scope of the instant invention. 

A PDAT of this invention includes any sequence of amino acids, such as a 
protein, polypeptide or peptide fragment obtainable from a microorganism, 
20 animal or plant source that demonstrates, the ability to catalyse the production 
) of triacylglyceroi from a phospholipid and diacylglycerol under enzyme reactive 

conditions. By „enzyme reactive conditions" is meant that any necessary 
conditions are available in an environment (e.g., such factors as temperature, 
pH, lack of inhibiting substances) which will permit the enzyme to function. 

23 

Other PDATs are obtainable from the specific sequences provided herein. 
Furthermore, it will be apparent that one can obtain natural and synthetic 
PDATs, including modified amino acid sequences and starting materials for 
synthetic-protein modelling from the examplified PDATs and from PDATs which 
30 are obtained through the use of such examplified sequences. Modified amino 
acid sequences include sequences that have been mutated, truncated, 
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increased and the like, whether such sequences were partially or wholly 
synthesised. Sequences that are actually purified from plant preparations or 
are identical or encode identical proteins thereto, regardless of the method 
used to obtain the protein or sequence, are equally considered naturally 
5 derived. 

Further, the nucleic acid probes (DNA and RNA) of the present invention can 
be used to screen and recover ^homologous" or ..related" PDATs from a variety 
of plant and microbial sources. 

Further, it is also apparent that a person skilled in the art can, with the 
information provided in this application, in any organism identify a PDAT 
activity, purify an enzyme with this activity and thereby identify a „non- 
homologous" nucleic acid sequence encoding such an enzyme. 

15 

The present invention can be essentially characterized by the following 
aspects: 

1 . Use of a PDAT gene (genomic clone or cDNA) for transformation. 
20 2. Use of a DNA molecule according to item 1 wherein said DNA is used for 
transformation of any organism in order to be expressed in this organism 
and result in an active recombinant PDAT enzyme in order to increase oil 
content of the organism. 

3. Use of a DNA molecule of item 1 wherein said DNA is used for 
25 transformation of any organism in order to prevent the accumulation of 

undesirable fatty acids in the membrane lipids. 

4. Use according to item 1 , wherein said PDAT gene is used for transforming 
transgenic oil accumulating organisms engineered to produce any 
uncommon fatty acid which is harmful if present in high amounts in 

30 membrane lipids, such as medium chain fatty acids, hydrdxylated fatty 
acids, epoxygenated fatty acids and acetyienic fatty acids. 
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5. Use according to item 1 , wherein said PDAT gene is used for transforming 
organisms, and wherein said organisms are crossed with other oil 
accummulating organisms engineered to produce any uncommon fatty acid 
which is harmful if present in high amounts in membrane lipids, comprising 
medium chain fatty acids, hydroxyiated fatty acids, epoxygenated fatty 
acids and acetylenic fatty acids. 

6. Use according to item 1 , wherein the enzyme encoded by said PDAT gene 
or cDNA is coding for a PDAT with distinct acyl specificity. 

7. Use according to item 1 wherein said PDAT encoding gene or cDNA, is 
derived from Saccharornyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 30% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 

8. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharornyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 40% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 

9. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharornyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 60% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 

10. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from Saccharornyces cereviseae, or contain nucleotide sequences 
coding for an amino acid sequence 80% or more identical to the amino acid 
sequence of PDAT as presented in SEQ. ID. NO. 2. 

11. Use according to item 1 wherein said PDAT encoding gene or cDNA is 
derived from plants or contain nucleotide sequences coding for an amino 
acid sequence 40% or more identical to the amino acid sequence of PDAT 
from Arabidopsis thaliana or to the protein encoded by the fuilength 
counterpart of the partial Zea mays, Lycopericon esculentum, or 
Neurospora crassa cDNA clones. 
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12. Transgenic oil accumulating organisms comprising, in their genome, a 
PDAT gene transferred by recombinant DNA technology or somatic 
hybridization. 

13. Transgenic oil accumulating organisms according to item 12 comprising, in 
5 their genome, a PDAT gene having specificity for substrates with a 

particular uncommon fatty acid and the gene for said uncommon fatty acid. 

14. Transgenic organisms according to item 12 or 13 which are selected from 
the group consisting of fungi, plants and animals. 

15. Transgenic organisms according to item 12 or 13 which are selected from 
10 the group of agricultural plants. 

16. Transgenic organisms according to item 12 or 13 which are selected from' 
the group of agricultural plants and where said PDAT gene is expressed 
under the control of a storage organ specific promoter. 

17. Transgenic organisms according to item 12 or 13 which are selected from 
15 the group of agricultural plants and where said PDAT gene is expressed 

under the control of a seed promotor. 

1 8. Oils from organisms according, to item 12-17. 

19. A method for altering acyl specificity of a PDAT by alteration of the 
nucleotide sequence of a naturally occurring encoding gene and as a 

20 consequence of this alternation creating a gene encoding for an enzyme 
with novel acyl specifity. 

20. A protein encoded by a DNA molecule according to item 1 or a functional 
fragment thereof . 

21. A protein of item 20 designated phospholipid:diacylglycerol acyltransferase. 
25 22. A protein of item 21 which has a distinct acyl specificity. 

23. A protein of item 13 having the amino acid sequence as set forth in SEQ, 
ID NO. 2, 13, 14 or 15 (and the proteins encoded by the fullength or partial 
genes set forth in SEQ. ID. NO. 1, 3, 4, 5* 7, 9, 10, 11 or 12) or an amino 
acid sequence with at least 30 % homology to said amino acid sequence. . 
30 24. A protein of item 23 isolated from Saccharomyces cereviseae. 
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General methods: 

Yeast strains and vlasmids . The wild type yeast strains used were either 
FY1679 {MATol his3-A200 Ieu2-A1 trp1-A6 ura3-52) or W303-1 A (MATa ADE2- 
5 1 can1-100 his3-1 1, 15 leu2-3, 1 12 trp1-1 ura3-1) (7). The YNR008w::KanMX2 
disruption strain FVKT004-04C(AL), which is congenic to FY1679, was 
obtained from the Euroscarf collection (8). A 2751 bp fragment containing the 
YNR008w gene with 583 bp of 5' and 1 83 bp of 3' flanking DNA was amplified 
from W303-1A genomic DNA using Tag polymerase with 5'- 
) 10 TCTCCATCTTCTGCAAAACCT-3' and 5'-CCTGTCAAAAACCTTCTCCTC-3' as 
primers. The resulting PCR product was purified by agarose gel electrophoresis 
and cloned into the EcoRV site of pBluescript (pbluescript-pdat). For 
complementation experiments, the cloned fragment was released from 
pBluescript by H/ndl 1 1 -Sad digestion and then cloned between the H/ndlll and 
15 Sad sites of pFL39 (9), thus generating pUS1 . For overexpression of the PDAT 
gene, a 2202 bp EcoRI fragment from the pBluscript plasmid which contains 
only 24 bp of 5' flanking DNA was cloned into the BamHI site of the GAL1- 
TPK2 expression vector pJN92 (12>, thus generating pUS4. 

20 Microsomal preparations: Microsomes from developing seeds of sunflower 

.) (Helianthus annuus), Ricinus communis and Crepis palaestina were prepared 

using the procedure of Stobart and Stymne (11). To obtain yeast microsomes, 
1g of yeast cells (fresh weight) was re-suspended in 8 ml of ice-cold buffer (20 
mM Tris-CI, pH 7.9, 10 mM MgCI 2 , 1 mM EDTA, 5 % (v/v) glycerol, 1 mM DTT, 

25 0.3 M ammonium sulfate) in a 12 ml glass tube. To this tube, 4 ml of glass 
beads, (diameter 0.45-0.5 mm) were added, and the tube was then heavily 
shaken (3 x 60 s) in an MSK cell homogenizer (B. Braun Melsungen AG, 
Germany). The homogenized suspension was centrifuged at 20,000 x g for 15 
min at 6°G and the resulting supernatant was again centrifuged at 100,000 x g 

30 . for 2 h at 6°C. The 100,000 x g pellet was resuspended in 0.1 M potassium 
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phosphate (pH 7.2), and stored at -80°C. It is subsequently referred to as the 
crude yeast microsomal fraction. 



Lipid substrates. Radio-labeled ricinoleic (12-hydroxy-9-octadecenoic) and 
5 vemolic {12,13-epoxy-9-octadecenoic) acids were synthesized enzymatically 
from [1- u C]oleic acid and [1- u CJIinoleic acid, respectively, by incubation with 
microsomal preparations from seeds of Ricinus communis and Crepis 
palaestina, respectively (12). The synthesis of phosphatidylcholines (PC) or 
phosphatidylethanolamines (PE) with u C-labeled acyl groups in the sn-2 
10 position was performed using either enzymatic (13), or synthetic (14) acylation 
of [ u C]oleic, [ 14 C]ricinoleic, or [ 14 C]vernolic acid. Dioleoyl-PC that was labeled 
in the sn-1 position was synthesized from sn-1-[ 14 C}oleoy!-lyso-PC and 
unlabeled oleic acid as described in (14). Sr>1-oleoyl-sn-2-[ u C]ricinoleoyl-DAG 
was synthesized from PC by the action of phospholipase C type XI from B. 
15 Cereus (Sigma Chemical Co.) as described in (15). Monovernoloyl- and 
divemoleoyl-DAG were synthesized from TAG extracted from seeds of 
Euphorbia lagascae, using the TAG-lipase (Rizhopus arrhizus, Sigma Chemical 
Co.) as previously described (16). Monoricinoleoyl-TAG was synthesized 
according to the same method using TAG extracted from Castor bean. 

20 

Lipid analysis.- Total lipid composition' of yeast were determined from cells 
harvested from a 40 ml liquid culture, broken in a glass-bead shaker and 
extracted into chloroform as described by Bligh and Dyer (17), and then 
separated by thin layer chromatography in hexane/diethylether/acetic acid 

25 (80:20:1) using pre-coated silica gel 60 plates (Merck). The lipid areas were 
located by brief exposure to \z vapors and identified by means of appropriate 
standards. Polar lipids, sterol-esters and triacylglycerols, as well as the 
remaining minor lipid classes, referred to as .other lipids, were excised from the 
plates. Fatty acid' methylesters were prepared by heating the dry excised 

30 material at 85 °C for 60 min in 2% "(v/v) sulfuric acid in dry methanol. The 
methyl esters were extracted with hexane and analyzed by GLC through a 50 m 
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x 0.32 mm CP-Wax58-CB fused-siiica column (Chrompack), with 
methylheptadecanoic acid as an internal standard. The fatty acid content of 
each fraction was quantified and used to calculate the relative amount of each 
lipid class. In order to determine the total lipid content, 3 ml aliquots from yeast 
5 cultures were harvested by centrifugation and the resulting pellets were washed 
with distilled water and lyophiilzed. The weight of the dried cells was 
determined and the fatty acid content was quantified by GLC-analyses after 
conversion to methylesters as described above. The lipid content was then 
calculated as nmol fatty acid (FA) per mg dry weight yeast. 



Enzyme assays. Aliquots of crude microsomal fractions (corresponding to 
10 nmol of microsomal PC) from developing plant seeds or yeast cells were 
lyophilized over night. 14 C-Labeled substrate lipids dissolved in benzene were 
then added to the dried microsomes. The benzene was evaporated under a 

is stream of N 2 , leaving the lipids in direct contact with the membranes, and 0.1 
ml of 50 rnM potassium phosphate (pH 7.2) was added. The suspension was 
thoroughly mixed and incubated at 30°C for the time period indicated, up to 90 
min. Lipids were extracted from the reaction mixture using chloroform . and 
separated by thin layer chromatography in hexane/diethylether/acetic acid 

20 (35:70:1.5) using silica gel 60 plates (Merck). The radioactive lipids were 
) visualized and quantified on the plates by electronic autoradiography (Instant 

Imager, Packard, US). 

Yeast cultivation. Yeast cells were grown at 28°C on a rotatory shaker in 
25 liquid YPD medium (1% yeast extract, 2% peptone, 2% glucose), synthetic 
medium (18) containing 2% (v/v) glycerol and 2% (v/v) ethanol/or minimal 
medium (19) containing 16 g/l of glycerol. 

The instant invention is further characterized by the following examples which 
30 are not limiting: 



10 
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Acvl-CoA-independent synthesis of TAG bv oil seed microsomes. A large 
number of unusual fatty acids can be found in oil seeds (20). Many of these 
fatty acids, such as ricinoleic (21) and vernolic acids (22), are synthesized 
using phosphatidylcholin (PC) with oleoyl or linoleoyl groups esterified to the 
5 sn-2 position, respectively, as the immediate precursor. However, even though 
PC can be a substrate for unusual fatty acid synthesis and is the major 
membrane lipids in seeds, unusual fatty acids are rarely found in the 
membranes. Instead, they are mainly incorporated into the TAG. A mechanism 
for efficient and selective transfer of these unusual acyl groups from PC into 
10 TAG must therefore exist in oil seeds that accumulate such unusual fatty acids. 
This transfer reaction was biochemically characterized in seeds from castor 
bean (Ricinus communis) and Crepis palaestina, plants which accumulate high 
levels of ricinoleic and vernolic acid, respectively, and sunflower (Helianthus 
annuus), a plant which has only common fatty acids in its seed oil. Crude 
15 microsomal fractions from developing seeds were incubated with PC having 
u C-labeted oleoyl, ricinoieoyl or vernoloyl groups at the sn-2 position. After the 
incubation, lipids were extracted and analyzed by thin layer chromatography. 
We found that the amount of radioactivity that was incorporated into the neutral 
lipid fraction increased linearly over a period of 4 hours (data not shown). The 
.20 distribution of [ 14 C]acyl groups within the neutral lipid fraction was analyzed 
after 80 min (Fig. 1). Interestingly the amount and distribution of radioactivity 
between diffferent neutral lipids were strongly dependent both on the plant 
species and on the type of [ 14 C]acyl chain. Thus, sunflower microsomes 
incorporated most of the label into DAG, regardless of the type of [ 14 C]acyl 
25 group. In contrast, ft communis microsomes preferentially incorporated 
[ 14 C}ricinoleoyl and [ 14 C]vernoloyl groups into TAG, while [ 14 C]oleyl groups 
mostly were found in DAG. C. palaestina microsomes, finally, incorporated only 
[ 14 C]vemolyol groups into TAG, with [ 14 C]ricinoleyl groups being found mostly 
as free fatty acids, and [ 14 C]oleyi groups in DAG. This shows that the high in 
30 vivo levels of ricinoleic acid and vernoiic acid in the TAG pool of ft. communis 
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and C. palaestina, respectively, can be explained by an efficient and selective 
transfer of the corresponding acyl groups from PC to TAG in these organisms. 

The in-vitro synthesis of triacylglycerols in microsomal preparations of 
5 developing castor bean is summarized in table 1 . 

PDAT: a novel enzyme that catalyzes acvl-CoA independent synthesis of 
TAG. It was investigated if DAG could serve both as an acyi donor as well as 
an acyl acceptor in the reactions catalyzed by the oil seed microsomes. 
) to Thererfore, unlabeled divernoloyl-DAG was incubated with either sn-1-oleoyl- 
sn-2-[ 14 C]ricino!eoyl-DAG or sn-1-oleoyl-sn-2-[ u C]ricinoleoyl-PC in the 
presence of R. communis microsomes. The synthesis of TAG molecules 
containing both [ 14 C]ricinoleoyl and vernoloyl groups was 5 fold higher when 
[ u C]ricinoleoyl-PC served as acyl donor as compared to [ u C]ricinoleoyl-DAG 
15 (fig.1 B). These data strongly suggests that PC is the immediate acyl donor and 
DAG the acyl acceptor in the acyl-CoA-independent formation of TAG by oil 
seed microsomes. Therefore, this reaction is catalyzed by a new enzyme which 
we call phospholipid : diacylglycerol acyitransferase (PDAT). 

20 PDAT activity in veast microsomes. Wild type yeast cells were cultivated 

) under conditions where TAG synthesis is induced. Microsomal membranes 

were prepared from these cells and incubated with s/>2-[ 14 C]-ricinoleoyl-PC 
and DAG and the 14 C-labeled products formed were analyzed. The PC-derived 
t 14 C]ricinoleoyl groups within the neutral lipid fraction mainly were found in free 

25 fatty acids or TAG, and also that the amount of TAG synthesized was 
dependent on the amount of DAG that was added to the reaction (Fig.2). The in 
vitro synthesis of TAG containing both ricinoleoyl and vernoloyl groups, a TAG 
species not present in vivo, from exogenous added sr>2-[ u C}ricinoleoyl-PC and 
unlabelled vemoloyl-DAG (Fig. 2, lane 3) clearly demonstrates the existence of 

30 an acyl-CoA-independent synthesis of TAG involving PC and DAG as 
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substrates in yeast microsomal membranes. Consequently, TAG synthesis in 
yeast can be catalyzed by an enzyme similar to the PDAT found in plants. 

The PDAT encoding aene in veast 

5 A gene in the yeast genome (YNR008w) is known, but nothing is known about 
the function of YNR008w, except that the gene is not essential for growth under 
normal circumstances. Microsomal membranes were prepared from the yeast 
strain FVKT004-04C(AL) (8) in which this gene with unknown function had been 
disrupted. PDAT activity in the microsomes were assayed using PC with 

10 radiolabeled fatty acids at the sn-2 position. The activity was found to be 
completely absent in the disruption strain (Fig. "2 lane 4). Significantly, the 
activity could be partially restored by the presence of YNR008w on the single 
copy plasmid pUS1 (Fig. 2 iane 5). Moreover, acyl groups of 
phosphatidylethanolamine (PE) were efficiently incorporated into TAG by 

15 microsomes from the wild type strain whereas no incorporation occured from 
this substrate in the mutant strain (data not shown). This shows that YNR008w 
encodes a yeast PDAT which catalyzes the transfer of an acyl group from the 
sn-2 position of phospholipids to DAG, thus forming TAG. It should be noted 
that no cholesterol esters were formed from radioactive PC even in incubations 

20 with added ergosterols, nor were the amount of radioactive free fatty acids 
formed from PC affected by disruption of the YNROOSw gene (data not shown). 
This demonstrates that yeast PDAT do not have cholesterol ester synthesising 
or phospholipase activities. 

25 Increased TAG content in veast cells that overexoress PDAT . The effect 

of overexpressing the PDAT-encoding gene was studied by transforming a wild 
type yeast strain with the pUS4 plasmid in which the gene is expressed from 
the galactose-induced GAL1:TPK2 promoter. Cells containing the empty 
expression vector were used as a control. The cells were grown in synthetic 

30 giycerol-ethanol medium, and expression of the gene was induced after either 2 
hours (early log phase) or 25 hours (stationary phase) by the addition of 
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galactose. The cells were then incubated for another 21 hours, after which they 
were harvested and assays were performed. We found that overexpression of 
PDAT had no significant effect on the growth rate as determined by the optical 
density. However, the total lipid content, measured as umol fatty acids per mg 

5 yeast dry weight, was 47% (log phase) or 29% (stationary phase) higher in the 
PDAT overexpressing strain than in the control. Furthermore, the polar lipid and 
sterolester content was unaffected by overexpression of PDAT. Instead, the 
elevated lipid content in these cells is entirely due to an increased TAG content 
(Fig. 3A.B). Thus, the amount of TAG was increased by 2-fold in PDAT 

10 overexpressing early log phase cells and by 40% in stationary phase cells. It is 
interesting to note that a significant increase in the TAG content was achieved 
by overexpressing PDAT even under conditions {i.e. in stationary phase) where 
DAGAT is induced and thus contributes significantly to TAG synthesis. In vitro 
PDAT activity assayed in microsomes from the PDAT overexpressing strain 

is was 7-fold higher than in the control strain, a finding which is consistent wlth>the 
increased levels of TAG that we observed in vivo (Fig. 3C). These results 
clearly demonstrate the potential use of the PDAT gene in increasing the oil 
content in transgenic organisms. 

20 Substrate specificity of veast PDAT. The substrate specificity of yeast 

PDAT was analyzed using microsomes prepared from the PDAT 
overexpressing strain (see Fig. 4). The rate of TAG synthesis, under conditions 
given in figure 4 with di-oleoyl-PC as the acyl-donor, was 0.15 nmol per min 
and mg protein. With both oleoyl groups of PC labeled it was possible, under 

25 the given assay conditions, to detect the transfer of 1 1 pmol/min of [ 14 C]oieoyl 
chain into TAG and the formation of 15 pmol/min of lyso-PC. In microsomes 
from the PDAT-deficient strain, no TAG at all and only trace amounts of lyso- 
PC was detected, strongly suggesting that yeast PDAT catalyses the formation 
of equimolar amounts of TAG and lyso-PC when supplied with PC and DAG as 

30 substrates. The fact that somewhat more lyso-PC than TAG is formed can be 
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explained by the presence of a phospholipase in yeast microsomes, which 
produces lyso-PC and unesterified fatty acids from PC. 

The specificity of yeast PDAT for different acyl group positions was 

5 investigated by incubating the microsomes with- di-oleoyl-PC carrying a 
[ 14 C]acyl group either at the sn-1 position (Fig. 4A bar 2) or the sn-2 position 
(Fig. 4A bar 3). We found that the major 14 C-labeled product formed in the 
former case was lyso-PC, and in the latter case TAG. We conclude that yeast 
PDAT has a specificity for the transfer of acyl groups from the sn-2 position of 

10 the phospholipid to DAG, thus forming sn-1 -lyso-PC and TAG. Under the given 
assay conditions, trace amounts of 14 C-!abelled DAG is formed from the sn-1 
labeled PC by the reversible action of a CDP-choline : choline 
phosphotransferase. This labeled DAG can then be further converted into TAG 
by the PDAT activity. It is therefore not possible to distinguish whether the 

15 minor amounts of labeled TAG that is formed in the presence of di-oleoyl-PC 
carrying a [ H C]acyl group in the sn-1 position, is synthesized directly from the 
sn-1 -labeled PC by a PDAT that also can act on the sn-1 postion, or if it is first 
converted to sn-1 -labeled DAG and then acylated by a PDAT with strict 
selectivity for the transfer of acyl groups at the sn-2 position of PC. Taken 

20 together, this shows that the PDAT encoded by YNR008w catalyses an acyl 
transfer from the sn-2 position of PC to DAG, thus causing the formation of 
TAG and lyso-PC. 

The substrate specificity of yeast PDAT was further analyzed with respect 
25 to the headgroup of the acyl donor, the acyl group transferred and the acyl 
chains of the acceptor DAG molecule. The two major membrane lipids of S. . 
cerevisiae are PC and PE, and as shown in Fig. 4B (bars 1 and 2), dioleoyl-PE 
is nearly 4-fold more efficient than dioleoyl-PC as acyl donor in the PDAT- 
catalyzed reaction. Moreover, the rate of acyl transfer is strongly dependent on 
30 the type of acyl group that is transferred. Thus, a ricinoleoyl group at the sn-2 
position of PC is 2.5 times more efficiently transferred into TAG than an oleoyl 

SUBSTITUTE SHEET (RULE 26) 



SOOCID: <WO__0060095A?J_> 



WO 00/60095 



21 



PCT/EP00/02701 



group in the same position (Fig. 4B bars 1 and 3). In contrast, yeast PDAT has 
no preference for the transfer of vernoloyl groups over oleoyi groups (Fig. 4B 
bars 1 and 4). The acyl chain of the acceptor DAG molecule also affects the 
efficiency of the reaction. Thus, DAG with a ricinoleoyl or a vernoloyl group is a 
5 more efficient acyl acceptor than dioleoyl-DAG (Fig. 4B bars 1 , 5 and 6). Taken 
together, these results clearly show that the efficiency of the PDAT-catalyzed 
acyl transfer is strongly dependent on the properties of the substrate lipids. 

PDAT genes. Nucleotide and amino acid sequences of several PDAT 
) 10 genes are given as SEQ ID No. 1 through 15. Futher provisional and/or partial 
sequences are given as SEQ ID NO 1a through 5a and 1b through 11b, 
respectively. One of the Arabidopsis genomic sequences (SEQ ID NO. 4) 
identfied an Arabidopsis EST cDNA clone; T04806. This cDNA clone was fully 
characterised and the nucleotide sequence is given as SEQ ID NO. 5. Based 
15 on the sequence homology of the T04806 cDNA and the Arabidopsis thaliana 
genomic DNA sequence (SEQ ID NO 4) it is apparent that an additional A is 
present at position 417 in the cDNA clone (data not shown). Excluding this 
nucleotide would give the amino acid sequence depicted in SEQ ID NO. 12. 

20 Increased TAG content in seeds of Arabidopsis thaliana tha t express the 

) veast PDAT. For the expression of the yeast PDAT gene in Arabidopsis 

thaliana an EcoRI fragment from the pBluescript-PDAT was cloned together 
with napin promotor (25) into the vector pGPTV-KAN (26). A plasmid 
(pGNapPDAT) having the yeast PDAT gene in the correct orientation was 
25 identified and transformed into Agrobacterium tumefaciens. These bacteria 
were used to transform Arabidopsis thaliana Columbia (C-24) plants using the 
root transformation method (27). Plants transformed with an empty vector were 
used as controls. 

First generation seeds (T1) were harvested and germinated on kanamycin 
30 containing medium. Second generation seeds (T2) were pooled from individual 
plants and their fatty acid contents analysed by quantification of their methyl 
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esthers by gas liquid chromatography after methylation of the seeds with 2% 
sulphuric acid in methanol at 85 °C for 1,5 hours. Quantification was done with 
heptadecanoic acid methyl esters as internal standard. 
From the transformation with pGNapPDAT one T1 plant (26-14) gave raise to 
5 seven T2 plants of which 3 plants yielded seeds with statistically (in a mean 
difference two-sided test) higher oil content than seeds from 12 plants 
generated from T1 plant 32-4 transformed with an empty vector (table 2). 
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Description of Figures 
FIG. 1: 

Metabolism of u C-labeled PC into the neutral lipid fraction bv plant 
5 microsomes. (A) Microsomes from developing seeds of sunflower, ft 
communis and C. palaestina were incubated for 80 min at 30°C with PC (8 
nmol) having oleic acid in its sn-1 position, and either 14 C-labeled oleic, . 
ricinoleic or vernolic acid in its sn-2 position. Radioactivity incorporated in TAG 
(open bars), DAG (solid bars), and unsterified fatty acids (hatched bars) was 
) 10 quantified using thin layer chromatography followed by electronic 
autoradiography, and is shown as percentage of added labeled substrate. (B) 
Synthesis in vitro of TAG carrying two vernoloyl and one [ 14 C}ricinoleoyl group 
by microsomes from ft communis. The substrates added were unlabeled 
divernoloyl-DAG (5 nmol), together with either sn-1-oleoyl-sn-2-[ 14 C]ricinoleoyl- 
15 DAG (0.4 nmol, 7700 dpm/nmol) or sn-i-oleoyl-sn-2-[ 14 C]ricinoleoyl-PC (0.4 
nmol, 7700 dpm/nmol). The microsomes were incubated with the substrates for 
30 min at 30°C, after which samples were removed for lipid analysis as 
described in the section „general methods". The data shown are the average of 
two experiments. 

20 

) FIG. 2. 

PDAT activity in "veast microsomes, as visualized bv autoradioqram of neutral 
lipid products separated on TLC . Microsomal membranes (10 nmol of PC) from 
the wild type yeast strain FY 1679 (lanes 1-3), a congenic yeast strain 

25 (FVKT004-04C(AL)) that is disrupted for YNR008w (lane 4) or the same 
disruption strain transformed with the plasmid pUS1, containing the YNR008w 
gene behind its native promotor (lane 5), were assayed for PDAT activity. As 
substrates, we used 2 nmol sf>-1-oleoyl-sn-2-[ 14 C]ricinoleoyl-PC together with 
either 5 nmol of dioleoyl-DAG (lanes 2, 4 and 5) or rac-oleoyl-vernoleoyl-DAG 

30 (lane 3). The enzymatic assay and lipid analysis was performed as described in 
Materials and Methods. The cells were precultured for 20 h in liquid YPD 
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medium, harvested and re-suspended in an equal volume of minimal medium 
(19) containing 1 6 g/l glycerol. The cells were then grown for an additional 24 h 
prior to being harvested. Selection for the plasmid was maintained by growing 
the transformed cells in synthetic medium lacking uracil (18). Abbreviations: 1- 
5 OH-TAG, monoricinoleoyl-TAG; 1-OH-1-ep-TAG, monoricinoleoyl- 
monovernoloyl-TAG; OH-FA, unesterified ricinoleic acid. 

Fig. 3. 

I i piri content (A.B\ and PDAT activity (C) in PD AT overexpressinq yeast cells. 
10 The PDAT gene in the plasmid pUS4 was overexpressed from the galactose- 
induced GAL1-TPK2 promotor in the wild type strain W303-1A (7). Its 
expression was induced after (A) 2 hours or (B) 25 hours of growth by the 
addition of 2% final concentration (w/v) of galactose. The cells were then 
incubated for another 22 hours before being harvested. The amount of lipids of 
15 the harvested cells was determined by GLC-analysis of its fatty acid contents 
and is presented as umol fatty acids per mg dry weight in either TAG (open 
bar), polar lipids (hatched bar), sterol esters (solid bar) and other lipids (striped 
bar). The data shown are the mean values of results with three independent 
yeast cultures. (C) In vitro synthesis of TAG by microsomes prepared from 
20 yeast cells containing either the empty vector (vector) or the PDAT plasmid (+ 
PDAT). The cells were grown as in Fig. 3A. The substrate lipids dioleoyl-DAG 
(2.5 nmol) and sn-1-oleoyl-sr>-2-[ 14 C]-oleoyl-PC (2 nmol) were added to aliquots 
of microsomes (10 nmol PC), which were then incubated for 10 min at 28 °C. 
The amount of label incorporated into TAG was quantified by electronic 
25 autoradiography. The results shown are the mean values of two experiments. 

FIG. 4. 

Substrate specificity of veast PDAT. The PDAT activity was assayed by 
incubating aliquots of lyophilized microsomes (10 nmol PC) with substrate lipids 
30 at 30°C for 10 min (panel A) or 90 min (panel B). Unlabeled DAG (2.5 nmol) 
was used as substrates together with different labeled phospholipids, as shown 
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in the figure. (A) Sn-position specificity of yeast PDAT regarding the acyl donor 
substrate. Dioleoyl-DAG together with either sn-1-[ 14 C]oleoyl-sn-2-[ 14 C]oleoyl- 
PC (di-[ 14 C]-PC), s/M -[ 14 C]oleoyl-sn-2-oleoyl-PC (sa?1-[ u C]-PC> or sn-1-oleoyl- 
sn-2-[ u C]oleoyl-PC (sn2-[ u C]-PC). (B) Specificity of yeast PDAT regarding 
5 phospholipid headgroup and of the acyl composition of the phospholipid as well 
- as of the diacylglycerol. Dioleoyl-DAG together with either so-1-oleoyl-sn-2- 
[ 14 C]oleoyl-PC (oleoyl-PC), sn-1 -oleoyl-sn-2-[ 14 C]oleoyl-PE (oleoyl-PE), s/vl- 
oleoyl-s/7-2-[ 14 C]ricinoleoyl-PC (ricinoleoyl-PC) or sn-1 -oleoyl-sn-2- 
[ 14 C]vemoloyl-PC (vernoloyl-PC). In the experiments presented in the 2 bars to 
) 10 the far right, monoricinoleoyl-DAG (ricinoleoyl-DAG or mono-vernoloyl-DAG 
(vernoloyl-DAG) were used together with sn-1-oleoyl-sr>-2-[ u C]-oleoyl-PC. The 
label that was incorporated into TAG (solid bars) and lyso-PC (LPC, open bars) 
was quantified by electronic autoradiography. The results shown, are the mean 
values of two experiments. The microsomes used were from W303-1A..cells 
15 overexpressing the PDAT gene from the GAL1-TPK2 promotor, as described in 
Fig. 3. The expression was induced at early stationary phase and the cells were 
harvested after an additional 24 h. 

20 TAB.1: 

) In vitro synthesis of triacvlalvcerols in microsomal preparations of developing 

castor bean. Aliqouts of microsomes (20 nmol PC) were lyophilised and 
substrate lipids were added in benzene solution: (A) 0.4 nmol [ 14 C]-DAG (7760 
dpm/nmol). and where indicated 1.6 nmol unlabelled DAG; (B) 0.4 nmol [ 14 C]- 

25 DAG (7760 dpm/nmol) and 5 nmol unlabelled di-ricinoleoyl-PC and (C) 0.25 
nmol [ 14 C]-PC (4000 dpm/nmol). and 5 nmol unlabelled DAG. The benzene was 
evaporated by N 2 and 0.1 ml of 50 mM potassium phosphate was added, 
thoroughly mixed and incubated at 30 °C for (A) 20 min.; (B) and (C) 30 min.. 
Assays were terminated by extraction of the lipids in chloroform. The lipids 

30 were then separated by thin layer chromatography on silica gel 60 plates 
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(Merck; Darmstadt, Germany) in hexan/diethylether/acetic 35:70:1.5. The 
radioactive lipids were visualised and the radioactivity quantified on the plate by 
electronic autoradiography (Instant Imager, Packard, US). Results are 
presented as mean values of two experiments. 

5 

Radioactivity in different triacylgtycerols (TAG) species formed. Abbreviations 
used: 1-OH-, mono-ricinoleoyl-; 2-OH, di-ricinoleoy!-; 3-OH-, triricinoleoyl; 1- 
OH-1-ver-, mono-ricinoleoly-monovernoleoyl-; 1-OH-2-ver-, mono-ricinoleoyl- 
divernoleoyl-. Radiolabeiled DAG and PC were prepared enzymatically. The 
10 radiolabeiled ricinoleoyl group is attached at the sn-2-position of the lipid and 
unlabelled oleoyl group at the sn-1 -position. Unlabelled DAG with vernoleoyl- or 
ricinoleoyl chains were prepared by the action of TAG lipase (6> on oil of 
Euphorbia lagascae or Castor bean, respectively. Synthetic di-ricinoleoyl-PC 
was kindly provided from Metapontum Agribios (Italy). 

15 

TAB.2: 

Total fatty acids per mg of T2 seeds pooled from individual Arabidopsis thaliana 
plants transformed with yeast PDAT gene under the control of napin promoter 
20 (26-1 4) or transformed with empty vector (32-4). 

* = stastistical difference between control plants and PDAT transformed plants 
in a mean difference two-sided test at a = 5. 
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Description of the SEQ ID: 

SEQ ID NO. 1: Genomic DNA sequence and suggested amino acid sequence of 
the Saccharomyces cerevisiae PDAT gene, YNR008w, with GenBank accession 
5 number Z71 623 and Y1 31 39, and with nucleotide ID number 1 302481 . 

SEQ ID NO. 2. The amino acid sequence of the suggested open reading frame 
YNR008W from Saccharomyces cerevisiae. 

) 10 SEQ ID NO. 3: Genomic DNA sequence of the Schizosaccharomyces pombe gene 
SPBC776.14. 

SEQ ID NO. 4: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
with GenBank accession number AB006704. 

15 

SEQ ID NO. 5: Nucleotide sequence of the Arabidopsis thaliana cDNA clone with 
GenBank accession number T04806, and nucleotide ID number 315966. 

SEQ ID NO. 6: Predicted amino acid sequence of the Arabidopsis thaliana cDNA 
20 clone with GenBank accession number T04806. 

) 

Seq id NO. 7: Nucleotide and amino acid sequence of the Zea mays EST clone 
with GenBank accession number AI491339, and nucleotide ID number 4388167. 

25 SEQ ID NO. a. Predicted amino acid sequence of the Zea mays EST clone with 
GenBank accession number AI491339, and nucleotide ID number 43881 67. 

SEQ ID NO. ft DNA sequence of part of the Neurospora crassa EST clone 
W07G1, with GenBank accession number AI398644, and nucleotide ID number 
30 4241729. 
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SEQ ID NO. 10. Genomic DNA sequence of part of the Arabidopsis thaliana locus 
with GenBank accession number AC004557. 

SEQ ID NO. 11: Genomic DNA sequence of part of the Arabidopsis thaliana locus 
5 with GenBank accession number AC003027. 

SEQ ID NO. 12. DNA sequencce of part of the Lycopersicon esculentum cDNA 
clone with GenBank accession number AI486635. 

10 SEQ ID NO. 13: Amino acid sequence of the Schizosaccharomyces pombe 
putative open reading frame CAA22887 of the Schizosaccharomyces pombe gene 
SPBC776.14. 

SEQ ID NO. 14: Amino acid sequence of the Arabidopsis thaliana putative open 
15 reading frame AAC80628 derived from the Arabidopsis thaliana locus with 
GenBank accession number AC004557. 

SEQ ID NO 15: Amino acid sequence of the Arabidopsis thaliana putative open 
reading frame AAD10668 derived from the Arabidopsis thaliana locus with 
20 GenBank accession number AC003027. 

Further provisional and/or partial sequences are defined t hrough the following 
SEQ IDs: 

25 SEQ ID NO. 1a: The amino acid sequence of the yeast ORF YNR008w from 
Saccharomyces cerevisiae. 

SEQ ID NO. 2a: Amino acid sequence of the region of the Arabidopsis thaliana 
genomic sequence (AC004557). 

30 
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SEQ ID NO. 3a: Amino acid sequence of the region of the Arabidopsis thaliana 
genomic sequence (AB006704). 

SEQ ID NO. 4a: The corresponding genomic DNA sequence and amino acid 
5 sequence of the yeast ORF YNR008w from Saccharomyces cerevisiae. 

SEQ ID NO. 5a: The amino acid sequence of the yeast ORF YNR008w from 
Saccharomyces cerevisiae derived form the corresponding genomic DNA 
sequence. 

) 10 

SEQ ID NO. 1b: Genomic DNA sequence of the Saccharomyces cerevisiae 
PDAT gene, YNR008w, genebank nucleotide ID number 1302481, and the 
suggested YNR008w amino acid sequence. . 

15 

SEQ ID NO. 2b: The suggested amino acid sequence of the yeast gene 
YNR008W from Saccharomyces cerevisiae. 

SEQ ID NO. 3b: Genomic DNA sequence of the Schizosaccharomyces pombe 
20 gene SPBC776.14. 

) 

SEQ ID NO. 4b: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AB006704. 

25 SEQ ID NO. 5b: Nucleotide sequence and the corresponding amino acid 
sequence of the Arabidopsis thaliana EST-clone with genebank accession 
number T04806, and ID number 31 5966. 

SEQ ID NO. 6b: Nucleotide and amino acid sequence of the Zea mays cDNA 
30 clone with genebank ID number 4388167. 
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SEQ ID NO. 7b: Amino acid sequence of the Zea mays cDNA clone with 
genebank ID number 43881 67. 

SEQ ID NO. 8b: DNA sequence of part of the Neurospora crassa cDNA cione . 
5 W07G1 , ID number 4241729. 

SEQ ID NO. 9b: Genomic DNA sequence of part of the Arabiddpsis thaliana 
locus with genebank accession number AC004557. 

io SEQ ID NO. 10b: Genomic DNA sequence of part of the Arabidopsis thaliana 
locus with genebank accession number AC003027. 

SEQ ID NO. 11b: DNA sequence of part of the Lycopersicon esculentum cDNA 
clone with genebank accession number AI486635. 

15 



25 



30 
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Claims 

1 . An enzyme catalysing in an acyl-CoA-independent reaction the transfer of 
fatty acids from phospholipids to diacylglycerol in the biosynthetic pathway 

5 for the production of triacylglycerol. 

2. An enzyme according to claim 1, comprising an amino acid sequence as 
set forth in SEQ ID No. 2 or a functional fragment, derivate, allele, homolog 
or isoenzyme thereof. 

) io 

3. An enzyme according to claims 1 or 2 designated as 
phospholipid :diacylglycerol acyltransferase (PDAT). 

4. An enzyme according to claims 1 to 3, comprising an amino acid sequence 
15 as set forth in SEQ ID No. 1a, 2b or 5a or a functional fragment, derivate, 

allele, homolog or isoenzyme thereof. 

5. An enzyme according to claims 1 to 4, comprising an amino acid sequence 
selected from the group consisting of sequences as set forth in SEQ ID No. 
2a, 3a, 5b, 6, 7b, 8, 13, 14 or 15 or a functional fragment, derivate, allele, 
homolog or isoenzyme thereof. 

6. An enzyme according to claims 1 to 5, comprising an amino acid sequence 
encoded through a nucleotide sequence, a portion, derivate, allele or 

25 homolog thereof selected from the group consisting of sequences as set 
forth in SEQ ID No. 1, 1b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7, 8b, 9, 9b, 10, 10b, 
11, 11b or 12 or a functional fragment, derivate, allele, homolog or 
isoenzyme of the enzyme encoding amino acid sequence. 

30 7. A nucleotide sequence encoding an enzyme catalysing in an acyl-CoA- 
independent reaction the transfer of fatty acids from phospholipids to 
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diacylglycerol in the biosynthetic pathway for the production of 
triacylglycerol. 

B. A nucleotide sequence according to claim 7 encoding an enzyme 
5 designated as phospholipid:diacylglycerol acyltransferase (PDAT). 

9. A nucleotide sequence according to claims 7 or 8, selected from the group 
consisting, of sequences as set forth in SEQ ID No. 1, 1b, 3, 3b, 4, 4a, 4b, 
9b, 10, 10b or 1 1 or a portion, derivate, allele or homolog thereof. 

10 ) 

10. A partial nucleotide sequence corresponding to a fullength nucleotide 
sequence according to claims 7 to 9, selected from the group consisting of 
sequences as set forth in SEQ ID No. 5, 5b, 6b, 7, 8b, 9, 11b or 12 or a 
portion, derivate, allele or homolog thereof. 

15 

1 1 . A nucleotide sequence according to claims 7 to 10, comprising a nucleotide 
sequence which is at least 40% homologous to a nucleotide sequence 
selected form the group consisting of those sequences set forth in SEQ ID 
No. 1, 1b, 3, 3b, 4, 4a, 4b, 5, 5b, 6b, 7, 8b, 9, 9b, 10,10b, 11, 11b or 12. 

20 

12. A gene construct comprising a nucleotide sequence according to claims 7 '. ) 
to 1 1 operably linked to a heterologous nucleic acid. 

13. A vector comprising a nucleotide sequence according to claims 7 to 1 1 or a ( 
25 gene construct according to claim 12. 

< 

14. A vector according to claim 1 3, which is an expression vector. 

15. A vector according to claims 13 or 14, further comprising a selectable 
30 marker gene and/or nucleotide sequences for the replication in a host cell 

or the integration into the genome of the host cell. 
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16. A transgenic cell or organism containing a nucleotide sequence according 
to claims 7 to 11 and/or a gene construct according to claim 12 and/or a 
vector according to claims 13 to 15. 

5 

17. A transgenic cell or organism according to claim 16 which is an eucaryotic 
cell or organism. 

18. A transgenic cell or organism according to claims 16 or 17 which is a yeast 
) 10 cell or a plant cell or a plant. 

19. A transgenic cell or organism according to claims 16 to 18 having an 
altered biosynthetic pathway for the production of triacyiglyceroi. 

15 20. A transgenic cell or organism according to claims 16 to 19 having an 
altered oil content. 

21. A transgenic cell or organism according to claims 16 to 20 wherein the 
activity of PDAT is altered: 

20 

) 22. A transgenic cell or organism according to claims 16 to 21 wherein the 

altered activity of PDAT is characterized by an alteration in gene 
expression, catalytic activity and/or regulation of activity of the enzyme. 

25 23. A transgenic cell or organism according to claims 1 6 to 22 wherein the 
altered biosynthetic pathway for the production of triacyiglyceroi is 
characterized by the prevention of accumulation of undesirable fatty acids 
in the membrane lipids. 

30 24. A process for the production of triacyiglyceroi, comprising growing a 
transgenic cell or organism according to claims 1 6 to 23 under conditions 
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whereby the said nucleotide sequence according to claims 7 to 11 is 
expressed. 

25. Triacylglycerols produced by a process according to claim 24. 

5 

26. Use of a nucleotide sequence according to claims 7 to 11 and/or an 
enzyme according to claims 1 to 6 for the production of triacylglycero! 
and/or triacylglycerols with uncommon fatty acids. 

10 27. Use of a nucleotide sequence according to claims 7 to 11 and/or an 
enzyme according to claims 1 to 6 for the transformation of any cell or 
organism in order to be expressed in this cell or organism and result in an 
altered, preferably increased oil content of this cell or organism. 

15 
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Fig. 1: 
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Tab. 2; 

Tl plant T2 plant number nmol fatty acids per mg seed standard 
deviation 



32-4 



26-14 



1 


1277 


+11 (n=2) 


4 


1261 


+63 (n=3) 


5 


1369 


+17 (n=3) 


6 


1312 


±53 (n=4) 


7 


1197 


+54(n=5) 


8 


1240 


+78(n=4) 


9 


1283 


+54 (n= 5) 


10 


1381 


+35(n=5) 


1 


1444 


+110(n=4) 


2 


1617* 


+109 (n=4) 


3 


1374 


+37 (n=2) 


5 


1562* 


+70 (n=4) 


6 


1393 


+77 (n=4) 


7 


1433 


+98 (n=4) 


8 


1581* 


+82 (n=4) 
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Sequence Listing 

<210> 1 

<211> 1986 

<212> genomic DNA 

<213> Saccharornyces cerevisiae 



ate ttc att cct ggt gca ttc tta ggt gta ctt ttg ccg ttt age ttt 
lie Phe He Leu Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 
85 90 95 

ggc get tat cat gtt cat aat age gat age gac ttg ttt gac aac ttt 
Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe Asp Asn Phe 
100 105. HO 

gta aat ttt gat tea ctt aaa gtg tat ttg gat gat tgg aaa gat gtt 
Val Asn Phe Asp Ser Leu Lys Vai Tyr Leu Asp Asp Trp Lys Asp Val 
115 120 125 

etc cca caa ggc ata agt teg ttt att gat gat att cag get ggt aac 
Leu Pro Gin Gly He Ser Ser Phe lie Asp Asp lie Gin Ala Gly Asn 
130 135 140 



48 



<221> CDS 
<222> (1) P , (1983) 

<400> 1 

atg ggc aca ctg ttt cga aga aat gtc. cag aac caa aag agt gat ten 
Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 
1 5 10 15 

) gat gaa aac aat aaa ggg ggt tct gtt cat aac aag cga gag age aga 96 
Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 
20 25 30 

aac cac att cat cat caa cag gga tta ggc cat aag aga aga agg ggt 144 
Asn His He His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
35 40 45 

att agt ggc agt gca aaa aga aat gag cgt ggc aaa gat ttc gac agg 192 
He Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 

50 - - 55 . 60 

aaa aga gac ggg aac ggc aga aaa cgt tgg aga gat tec aga aga ctg 240 
Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu • 
65 70 75 80 



288 



336 



384 



432 
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tac tec aca tct tct tta gat gat etc age gaa aat: ttt gcc.gtt ggt 480 
Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

aaa caa etc tta cgt gat tat aat ate gag gec aaa cat cct get gta 528 
Lys Gin Leu Leu Arg Asp Tyr Asn lie Glu Ala Lys His Pro Val Val 
165 170 175 

atg gtt cct ggt gtc att.tct acg gga att gaa age tgg gga gtt att 57 6 
Met Val Pro Gly Val He Ser Thr Gly He Glu Ser Trp Gly Val He 
180 185 190 

gga gac gat gag tgc gat agt tct gcg cat ttt cgt aaa egg ctg tgg 624 - 
Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp 
195 200 205 

gga agt ttt tac atg ctg aga aca atg gtt atg gat aaa gtt tgt tgg 672 
Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 
210 215 220 

ttg aaa cat gta atg tta gat cct gaa aca ggt ctg gac cca ccg aac 720 
Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 . 240 

ttt acg eta cgt gca gca cag ggc ttc gaa tea act gat tat ttc ate 768 
Phe Thr Leu Arg Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 
245 250 255 

gca ggg tat tgg att tgg aac aaa gtt ttc caa aat ctg gga gta att 816 
Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 
260 265 -270 

ggc tat gaa ccc aat aaa atg acg agt get gcg tat gat tgg agg ctt 864 
Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu 
275 280 285 

gca tat tta gat eta gaa aga cgc gat agg tac ttt acg aag eta aag 912 
Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 
290 295 300 

gaa caa ate gaa ctg ttt cat caa ttg agt ggt gaa aaa gtt tgt tta 960 
Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
305 310 315 320 

att gga cat tct atg ggt tct cag att ate ttt tac ttt atg aaa tgg 1008 
He Gly His Ser Met Gly Ser Gin He lie Phe Tyr Phe Met Lys Trp 
325 330 ' 335 
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gtc gag get gaa ggc cct ctt tac ggt aat ggt g£t cgt ggc egg gtt 1056 
Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 

aac gaa cac ata gat tea ttc att aat gca gca.ggg acg ctt ctg ggc 1104 
Asn Glu His He Asp Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 
355 360- 365 

get cca aag gca gtt cca get eta att agt ggt gaa atg aaa gat ace 1152 
Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 
370 375 380 

att caa tta aat acg tta gee atg tat ggt ttg gaa aag ttc ttc tea 1200 
tie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

aga att gag aga gta aaa atg tta caa acg tgg ggt ggt ata cca tea 1248 
Arg He Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly He Pro Ser 
405 410 415 

atg eta cca aag gga gaa gag gtc att tgg ggg gat atg aag tea tct 1296 
Met Leu Pro Lys Gly Glu Glu Val He Trp Gly Asp Met Lys Ser Ser : 
420 425 430 

tea gag gat gca ttg aat aac aac act gac aca tac ggc aat ttc att 1344 
Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe He , . 
435 440 445 

cga ttt gaa agg aat acg age gat get ttc aac aaa aat ttg aca atg 1392 
Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 
.450 455 460 

aaa gac gec att aac atg aca tta teg ata tea cct gaa tgg etc caa 1440 
Lys Asp Ala He Asn Met Thr Leu Ser He Ser Pro Glu Trp Leu Gin 
465 470 475 480 

aga aga gta cat gag cag tac teg ttc ggc tat tec aag aat gaa gaa 1488 
Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
485 490 495 

gag tta aga aaa aat gag eta cac cac aag cac tgg teg aat cca atg 1536 
Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 
500 . 505 510 

gaa gta cca ctt cca gaa get ccc cac atg aaa ate tat tgt ata tac 1584 
Glu Val Pro Leu Pro Glu Ala Pro His Met Lys He Tyr Cys He Tyr 
515 520 525 
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ggg gtg aac aac cca act gaa agg gca tac gta cat aag gaa gag gat 1632' 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 

530 535 540 



gac tec tct get ctg aat ttg acc ate gac tac gaa age aag caa cct 
Asp Ser Ser Ala Leu Asn Leu Thr He Asp Tyr Glu Ser Lys Gin Pro 
545 550 555 560 



gcg gag ttg aac gat tac ate ttg aaa att gca age ggt aat ggc gat 
Ala Glu Leu Asn Asp Tyr lie Leu Lys He Ala Ser Gly Asn Gly Asp 
625 630 635 640 

etc gtc gag cca cgc caa ttg tct aat ttg age cag tgg gtt tct cag. 
Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 
645 650 655 

atg ccc ttc cca atg taa 
Met Pro Phe Pro Met 
660 



1680 



gta ttc. etc acc gag ggg gac gga acc gtt ccg etc gtg gcg cat tea 1728 
Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 570 575 

atg tgt cac aaa tgg gee cag ggt get tea ccg tac aac cct gec gga 1776 
Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 
580 585 590 

att aac gtt act att gtg gaa atg aaa cac cag. cca gat cga ttt gat 1824 
He Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 
595 600 605 

ata cgt ggt gga gca aaa age gee gaa cac gta gac ate etc ggc age 1872 
He Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser 
610^ ' 615 620 



1920 



1968 



1986 
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<210> 2 
<211> 661 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 2 

Met Gly Thr Leu Phe Arg Arg Asn Val Gin Asn Gin Lys Ser Asp Ser 

15 10 15 

Asp Glu Asn Asn Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 

20 25 30 

Asn His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 

35 40 45 

He Ser Gly Ser Ala Lys Arg Asn Glu Arg Gly Lys Asp Phe Asp Arg 

50 55 60 

Lys Arg Asp Gly Asn Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 
65 70 75 80 

He Phe . He Leu . Gly Ala Phe Leu Gly Val Leu Leu Pro Phe Ser Phe 

85 90 95 

Gly Ala Tyr His Val His Asn Ser Asp Ser Asp Leu Phe' Asp Asn- Phe 

100 105 HO 

Val Asn Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 

115 120 125 

Leu Pro Gin Gly He Ser Ser Phe He Asp Asp He Gin Ala Gly Asn 

130 135 140 

Tyr Ser Thr Ser Ser Leu Asp Asp Leu Ser Glu Asn Phe Ala Val Gly 
145 150 155 160 

Lys Gin Leu Leu Arg Asp Tyr Asn He Glu Ala Lys His Pro Val Val 

165 170 175 

Met Val Pro Gly Val He Ser Thr Gly lie Glu Ser Trp Gly Val He 

180 185 190 

Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys. Arg Leu Trp 

195 200 205 

Gly Ser Phe Tyr Met Leu Arg Thr Met Val Met Asp Lys Val Cys Trp 

210 215 220 

Leu Lys His Val Met Leu Asp Pro Glu Thr Gly Leu Asp Pro Pro Asn 
225 230 235 240 

Phe Thr Leu Arg. Ala Ala Gin Gly Phe Glu Ser Thr Asp Tyr Phe He 

245 250 255 

Ala Gly Tyr Trp He Trp Asn Lys Val Phe Gin Asn Leu Gly Val He 

260 265 270 

Gly Tyr Glu Pro Asn Lys Met Thr Ser Ala Ala Tyr Asp Trp Arg Leu ' 

275 .280 285 

Ala Tyr Leu Asp Leu Glu Arg Arg Asp Arg Tyr Phe Thr Lys Leu Lys 

290 295 300 

Glu Gin He Glu Leu Phe His Gin Leu Ser Gly Glu Lys Val Cys Leu 
. 305 310 315 320 

He Gly His Ser Met Gly Ser Gin He He Phe Tyr Phe Met Lys Trp 
325 330 335 
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Val Glu Ala Glu Gly Pro Leu Tyr Gly Asn Gly Gly Arg Gly Trp Val 
340 345 350 



Asn Glu His He Asp Ser Phe He Asn Ala Ala Gly Thr Leu Leu Gly 

355 360 365 

Ala Pro Lys Ala Val Pro Ala Leu He Ser Gly Glu Met Lys Asp Thr 

370 375 380 

lie Gin Leu Asn Thr Leu Ala Met Tyr Gly Leu Glu Lys Phe Phe Ser 
385 390 395 400 

Arg He Glu Arg Val Lys Met Leu Gin Thr Trp Gly Gly He Pro Ser 

405 410 415 

Met Leu Pro Lys Gly Glu Glu Val He Trp Gly Asp Met Lys Ser Ser 

420 425 430 

Ser Glu Asp Ala Leu Asn Asn Asn Thr Asp Thr Tyr Gly Asn Phe He 

435 440 445 

Arg Phe Glu Arg Asn Thr Ser Asp Ala Phe Asn Lys Asn Leu Thr Met 

450 455 460 

Lys Asp Ala He Asn Met Thr Leu Ser He Ser Pro Glu Trp Leu Gin 
465 470 475 480 

Arg Arg Val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 

485 490 495 

Glu Leu Arg Lys Asn Glu Leu His His Lys His Trp Ser Asn Pro Met 

500 505 510 

Glu Val Pro Leu Pro Glu Ala Pro His Met Lys He Tyr Cys He Tyr 

515 520 525 

Gly Val Asn Asn Pro Thr Glu Arg Ala Tyr Val Tyr Lys Glu Glu Asp 

530 535 540 

Asp Ser Ser Ala Leu Asn Leu Thr He Asp Tyr Glu Ser Lys Gin Pro 
545 550 • 555 560 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 



Met Cys His Lys Trp Ala Gin Gly Ala Ser Pro Tyr Asn Pro Ala Gly 

580 585 590 

He Asn Val Thr He Val Glu Met Lys His Gin Pro Asp Arg Phe Asp 

595 600 605 

He Arg Gly Gly Ala Lys Ser Ala Glu His Val Asp He Leu Gly Ser, 

610 615 620 

Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
625 630 635 640 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Gin 

645 650 655 

Met Pro Phe Pro Met 



565 



570 



575 



660 
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<2io> 3' ~ - " " • — r 

<211> 2312 

<212> genomic DNA 

<213> Schizcsaccharomyces pombe 

<400> 3 

ATGGCGTCTT CCAAGAAGAG CAAAACTCAT AAGAAAAAGA AAGAAGTCAA 50 

ATCTCCTATC GACTTACCAA ATTCAAAGAA ACCAACTCGC GCTTTGAGTG 100 

AGCAACCTTC AGCGTCCGAA ACACAATCTG TTTCAAATAA ATCAAGAAAA 150 

TCTAAATTTG GAAAAAGATT GAATTTTATA TTGGGCGCTA TTTTGGGAAT 200 

ATGCGGTGCT TTTTTTTTCG CTGTTGGAGA CGACAATGCT GTTTTCGACC 250 
CTGCTACGTT AGATAAATTT GGGAATATGC TAGGCTCTTC AGACTTGTTT . 300 

GATGACATTA AAGGATATTT ATCTTATAAT GTGTTTAAGG' ATGCACCTTT 350 

TACTACGGAC AAGCCTTCGC AGTCTCCTAG CGGAAATGAA GTTCAAGTTG 400 

GTCTTGATAT GTACAATGAG GGATATCGAA GTGACCATCC TGTTATTATG 450 

GTTCCTGGTG TTATCAGCTC AGGATTAGAA AGTTGGTCGT TTAATAATTG 500 

CTCGATTCCT TACTTTAGGA AACGTGTTTG GGGTAGCTGG TCTATGCTGA 550 

AGGCAATGTT CCTTGACAAG CAATGCTGGC TTGAACATTT AATGCTTGAT 600 

AAAAAAACCG GCTTGGATCC GAAGGGAATT AAGCTGCGAG CAGCTCAGGG 650 

GTTTGAAGCA GCTGATTTTT TTATCACGGG CTATTGGATT TGGAGTAAAG 700 

TAATTGAAAA CCTTGCTGCA ATTGGTTATG AGCCTAATAA CATGTTAAGT 750 

GCTTCTTACG ATTGGCGGTT ATCATATGCA AATTTAGAGG AACGTGATAA 800 

ATATTTTTCA AAGTTAAAAA TGTTCATTGA GTACAGCAAC ATTGTACATA 850 

AGAAAAAGGT AGTGTTGATT TCTCACTCCA TGGGTTCACA GGTTACGTAC 900 

TATTTTTTTA AGTGGGTTGA AGCTGAGGGC TACGGAAATG GTGGACCGAC 950 

TTGGGTTAAT GATCATATTG AAGCATTTAT AAATGTGAGT CTCGATGGTT 1000 
GTTTGACTAC GTTTCTAACT TTTGAATAGA TATCGGGATC TTTGATTGGA 1050 

GCACCCAAAA CAGTGGCAGC GCTTTTATCG GGTGAAATGA AAGATACAGG 1100 

TATTGTAATT ACATTAAACA TGTTAATATT TAATTTTTGC TAACCGTTTT 1150 

AAGCTCAATT GAATCAGTTT TCGGTCTATG GGTAAGCAAT AAATTGTTGA 1200 

GATTTGTTAC TAATTTACTG TTTAGTTTGG AAAAATTTTT TTCCCGTTCT 1250 

GAGGTATATT CAAAAATACA AATGTGCTCT ACTTTTTCTA ACTTTTAATA 1300 

GAGAGCCATG ATGGTTCGCA CTATGGGAGG AGTTAGTTCT ATGCTTCCTA 1350 

AAGGAGGCGA TGTTGTATGG GGAAATGCCA GTTGGGTAAG AAATATGTGC 1400 

TGTTAATTTT TTATTAATAT TTAGGCTCCA GATGATCTTA ATCAAACAAA 1450 

TTTTTCCAAT GGTGCAATTA TTCGATATAG AGAAGACATT GATAAGGACC 1500 

ACGATGAATT TGACATAGAT GATGCATTAC AATTTTTAAA AAATGTTACA 1550 

GATGACGATT TTAAAGTCAT GCTAGCGAAA AATTATTCCC ACGGTCTTGC 1600 
TTGGACTGAA AAAGAAGTGT TAAAAAATAA CGAAATGCCG TCTAAATGGA 1650 

TAAATCCGCT AGAAGTAAGA ACATTAAAGT TACTAAATTA TACTAACCCA 1700 

AATAGACTAG TCTTCCTTAT GCTCCTGATA TGAAAATTTA TTGCGTTC AC 1750 

GGGGTCGGAA AACCAACTGA GAGAGGTTAT TATTATACTA ATAATCCTGA 1800 

GGGGCAACCT GTCATTGATT CCTCGGTTAA TGATGGAACA AAAGTTGAAA 1850 

ATGTGAGAGA ATTTATGTTT CAAACATTCT ATTAACTGTT TTATTAGGGT 1900 

ATTGTTATGG ATGAk TGGTGA TGGAACTTTA CCAATATTAG CCCTTGGTTT 1950 

GGTGTGCAAT AAAGTTTGGC AAACAAAAAG GTTTAATCCT GCTAATACAA 2000 

GTATCACAAA TTATGAAATC AAGCATGAAC CTGCTGCGTT TGATCTGAGA 2050 

GGAGGACCTC GCTCGGCAGA ACACGTCGAT ATACTTGGAC ATTCAGAGCT 2100 

AAATGTATGT TCATTTTACC TTACAAATTT CTATTACTAA CTCTTGAAAT 2150 
AAGGAAATTA TTTTAAAAGT TTCATCAGGC CATGGTGACT CGGTACCAAA 2200 

CCGTTATATA TCAGATATCC AGTACGGACA TAAGTTTTGT AGATTGCAAT . 2250 

TAACTAACTA ACCGAACAGG GAAATAATAA ATGAGATAAA TCTCGATAAA 2300 
CCTAGAAATT AA 2312 
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<210>~4 ™ 

<211> 3685 

<212> genomic DNA 

<213> Arabidopsis thaliana - 

<400> 4 

ATGCCCCTTA TTCATCGGAA AAAGCCGACG GAGAAACCAT CGACGCCGCC 
ATCTGAAGAG GTGGTGCACG ATGAGGATTC GCAAAAGAAA CCACACGAAT 
CTTCCAAATC CCACCATAAG AAATCGAACG GAGGAGGGAA GTGGTCGTGC 
ATCGATTCTT GTTGTTGGTT CATTGGGTGT GTGTGTGTAA CCTGGTGGTT 
TCTTCTCTTC CTTTACAACG CAATGCCTGC GAGCTTCCCT CAGTATGTAA 
CGGAGCGAAT CACGGGTCCT TTGCCTGACC CGCCCGGTGT TAAGCTCAAA 
AAAGAAGGTC TTAAGGCGAA ACATCCTGTT GTCTTCATTC CTGGGATTGT 
CACCGGTGGG CTCGAGCTTT GGGAAGGCAA ACAATGCGCT GATGGTTTAT 
TTAGAAAACG TTTGTGGGGT GGAACTTTTG GTGAAGTCTA CAAAAGGTGA 
GCTCAACAAT TCTCACTCTT CCTTTATATT GGGATTTGGA TTGGATCTGA 
TGAGATCACG CACTTGTTGC TTCTTCAACA TCACTCAAAC TTTAATTCCA 
TGTTTGTCTG TCTTACTCTT TACTTTTTTT TTTTTTTGAT GTGAAACGCT 
ATTTTCTTAA GAGACTATTT CTGTATGTGT AAGGTAAGCG TTCCAAGGAC 
GTAATTGGCT TGGACTATTT CTGTTTGATT GTTAACTTTA GGATATAAAA 
TAGCTGCCTT GGAATTTCAA GTCATCTTAT TGCCAAATCT GTTGCTAGAC 
ATGCCCTAGA GTCCGTTCAT AACAAGTTAC TTCCTTTACT GTCGTTGCGT 
GTAGATTTAG CTTTGTGTAG CGTATAATGA AGTAGTGTTT TATGTTTTGT 
TGGGAATAGA GAAGTTCTAA CTACATCTGT GGAAAGTGTG TTCAGGCTGT 
GATAGAGGAC TGTTGCTTTA TTATTCAACT ATGTATATGT GTAATTAAAG 
CTAGTTCCTT TTTGATCTTT CAGCTCAATG TGCTTTTCTC AATTTTTTTC 
TCAATTTCAA AGTTTCACAT CGAGTTTATT CACATGTCTT GAATTTCGTC 
CATCCTCGTT CTGTTATGCA GCTTTGAACT CCTCCCGACC CTGCTATGGA 
TATATTAAAA AAAAAGTGTT TTGTGGGTTG CATCTTTGTT ACGATCTGCA 
TCTTCTTCTT TCGGCTCAGT GTTCATGTTT TTGCTATGGT AGAGATGGGC 
AATGTTATTG TTGATGGTAA CAGTGGTATA GTTGATAGTA TCTTAACTAA 
TCAATTATCT CTTTGATTCA GGCCTCTATG TTGGGTGGAA. CACATGTCAC 
TTGACAATGA AACTGGGTTG GATCCAGCTG • GTATTAGAGT TCGAGCTGTA 
TCAGGACTCG TGGCTGCTGA CTACTTTGCT CCTGGCTACT TTGTCTGGGC 
AGTGCTGATT GCTAACCTTG CACATATTGG ATATGAAGAG AAAAATATGT 
ACATGGCTGC ATATGACTGG CGGCTTTCGT TTCAGAACAC AGAGGTTCTT 
TTCTCATCGT TCTTTCTATT ATTCTGTTCC ATGTTACGTT TCTTTCTTCA 
TTACTTAAGG CTTAAATATG TTTCATGTTG AATTAATAGG TACGTGATCA 
GACTCTTAGC CGTATGAAAA GTAATATAGA GTTGATGGTT TCTACCAACG 
GTGGAAAAAA AGCAGTTATA GTTCCGCATT CCATGGGGGT CTTGTATTTT 
CTACATTTTA TGAAGTGGGT TGAGGCACCA GCTCCTCTGG GTGGCGGGGG 
TGGGCCAGAT TGGTGTGCAA AGTATATTAA GGCGGTGATG AACATTGGTG 
GACCATTTCT TGGTGTTCCA AAAGCTGTTG CAGGGCTTTT CTCTGCTGAA 
GCAAAGGATG TTGCAGTTGC CAGGTATTGA ATATCTGCTT ATACTTTTGA 
TGATCAGAAC CTTGGCTCTG GAACTCAAAG TTATTCTACT AAATATCAAT 
TCTAATAACA TTGCTATATT ATCGCTGCAA CTGACATTGG TTGATTATTT 
TTGCTGCTTA TGTAACTGAA ACTCTCTTGA GATTAGACAA ATGATGAATT 
GATAATTCTT ACGCATTGCT CTGTGATGAC CAGTTTCTTA GCTTCGACGA 
TAACATTTGT CATACTGTCT TTTGGAGGGC ATTGAATTTT GCTATGGAAA 
GCGCTGGAGC TTCCATGCTT GCATTCTTTA CCAATTAGCG TTATTCTGCT 
TCTTTCAATT TTCTTGTATA TGCATCTATG GTCTTTTATT TCTTCTTAAT 
TAAAGACTCG TTGGATTAGT TGCTCTATTA GTCACTTGGT TCCTTAATAT 
AGAACTTTAC TTTCTTCGAA AATTGCAGAG CGATTGCCCC AGGATTCTTA 
GACACCGATA TATTTAGACT TCAGACCTTG CAGCATGTAA TGAGAATGAC 
ACGCACATGG GACTCAACAA TGTCTATGTT ACCGAAGGGA GGTGACACGA 
TATGGGGCGG GCTTGATTGG TCACCGGAGA AAGGCCACAC CTGTTGTGGG 
AAAAAGCAAA AGAACAACGA AACTTGTGGT GAAGCAGGTG AAAACGGAGT 
TTCCAAGAAA AGTCCTGTTA ACTATGGAAG GATGATATCT TTTGGGAAAG 
AAGTAGCAGA GGCTGCGCCA TCTGAGATTA ATAATATTGA TTTTCGAGTA 
AGGACATATA AATCATAATA AACCTTGTAC ATTTTGTGAT TGTATGATGA 
ATATCTGTAC ATTTTATCTG GTGAAGGGTG CTGTCAAAGG TCAGAGTATC 
CCAAATCACA CCTGTCGTGA CGTGTGGACA GAGTACCATG ACATGGGAAT 
TGCTGGGATC AAAGCTATCG CTGAGTATAA GGTCTACACT GCTGGTGAAG 
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1500 
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1600 
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CTATAGATCT 

GCTCATTTCT 

AGATCCCAAA 

GATTCCAACT 

GGTCTTGTTG 

GCCTTTCTCA 

CATGTGACTC 

GATTACCGAA 

ATACCAACGG 

TTGCATCCCC 

GCTGTCTGAA 

GTCCTAAGTG 

ATTCAACCCT 

CGCCGGCTAA 

GATATCATGG 

CGGAGGTAAC 

TTGAATGGTC 



ACTACATTAT 

CTTATGGAAT 

TACTGGTCAA 

GTATCCTTCG 

GATATGGTTT 

AAAAGGCTTG 

TTGCTTATAA 

TGCTCCTGAG 

AACGAGCATA 

TTTCAGATAT 

AGCAGGAGTT 

CCGGGTACAT 

TCCGGAATCA 

CCTGTTGGAA 

GAAACTTTGC 

GGGTCTGATA 

GGAGCGTATT 



GTTGCTCCTA 

TGCTGATGAT 

ATCCGTTAGA 

TCCTGATGCA 

TCAGCTCAAA 

CTCAGTAATA 

ATCCTCCGTT 

ATGGAAATCT 

CGTATACAAG 

TCACTTCTGC 

TACAATGTGG 

GTGTGCAAAA 

AGACTTATAT 

GGGCGCGGGA 

TTTGATCGAA 

TAGGACATGA 

GACCTGAAGC 



AGATGATGGC 

TTGGATGAGA 

GACAAAGTAA 

TTATCAGTCT 

GCTTACAAAG 

TTGAGGTGCT 

TGGTTTGTTC 

ACTCATTATA 

CTTAACCAGT 

TCACGAGGAG 

ATGGGGATGA 

GCGTGGCGTG 

AAGAGAATAC 

CGCAGAGTGG 

GATATCATGA 

CCAGGTCCAC 

TGTGA 



GCGTGGTGUC 

CCAAGTATCA 

GTGATTTCTT 
TTTTGTTTTC 

CTGTTTCTGA 
AAAGTTGATA 
TGCTTTTTCA 
CGGAGTGGGG 
CTCCCGACAG 
GACGAAGATA 
AACAGTACCC 
GCAAGACAAG 
AATCACTCTC 
TGCCCATGTT 
GGGTTGCCGC 
TCTGGCATAT 

368S 



3300 



-2W0" 
2950 
3000 
3050 
3100 
3150 
3200 
3250 

3350 
3400 
3450 
3500 
3550 
3600 
3650 
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<210> 5 
<211> 2427 
<212> cDNA 

<213> Arabidopsis thaliana 



<400> 5 
AGAAACAGCT 
TTCTAAATTC 
TTAATTTCAA 
AGAAACCATC 
CAAAAGAAAC 
AGGAGGGAAG 
TGTGTGTAAC 
AGCTTCCCTC 
GCCCGGTGTT 
GTCTTCATTC 
ACAATGCGCT 
GTGAAGTCTA 
AATGAAACTG 
ACTCGTGGCT 
TGATTGCTAA 
GCTGCATATG 
GACTCTTAGC 
GTGGAAAAAA 
CTACATTTTA 
TGGGCCAGAT 
GACCATTTCT 
GCAAAGGATG 
CGATATATTT 
GATGGGACTC 
GGCGGGCTTG 
GCAAAAGAAC 
AGAAAAGTCC 
GCAGAGGCTG 
CAAAGGTCAG 
ACCATGACAT 
TACACTGCTG 
GATGGCGCGT 
ATGACACCAA 
AAATTACCGA 
GATACCAACG 
. GTTGCATCCC 
AGCTGTCTGA 
CGTCCTAAGT 
GATTCAACCC 
CCGCCGGCTA 
TGATATCATG 
CCGGAGGTAA 
TTTGAATGGT 
TTAAGCTGTC 
ATCATCAATT 
AGCCTGAGAA 
CTTATTGAAT 
AAAACTTGCT 
GGGTTAAAAA 



CTTTGTCTCT 
CTGGACGAGA 
GTGACAGATA 
GACGCCGCCA 
CACACGAATC 
TGGTCGTGCA 
CTGGTGGTTT 
AGTATGTAAC 
AAGCTCAAAA 
CTGGGATTGT 
GATGGTTTAT 
CAAAAGGCCT 
GGTTGGATCC 
GCTGACTACT 
CCTTGCACAT 
ACTGGCGGCT 
CGTATGAAAA 
AGCAGTTATA 
TGAAGTGGGT 
TGGTGTGCAA 
TGGTGTTCCA 
TTGCAGTTGC 
AGACTTCAGA 
AACAATGTCT 
ATTGGTCACC 
AACGAAACTT 
TGTTAACTAT 
CGCCATCTGA 
AGTATCCCAA 
GGGAATTGCT 
GTGAAGCTAT 
GGTGCCGCTC 
GTATCAAGAT 
ATGCTCCTGA 
GAACGAGCAT 
CTTTCAGATA 
AAGCAGGAGT 
GCCGGGTACA 
TTCCGGAATC 
ACCTGTTGGA 
GGAAACTTTG 
CGGGTCTGAT 
CGGAGCGTAT 
CTGTCAGCTT 
CATCATCATC 
TGATACTTTG 
GTAAATTATA 
GCNGCCATGT 
AAAAAAAAAA 



CTCGACTGAT 
TTTGACAAAG 
TGCCCCTTAT 
TCTGAAGAGG 
TTCCAAATCC 
TCGATTCTTG 
CTTCTCTTCC 
GGAGCGAATC 
AAAGAAGGTC 
CACCGGTGGG 
TTAGAAAACG 
CTATGTTGGG 
AGCTGGTATT 
TTGCTCCTGG 
ATTGGATATG 
TTCGTTTCAG 
GTAATATAGA 
GTTCCGCATT 
TGAGGCACCA 
AGTATATTAA 
AAAGCTGTTG 
CAGAGCGATT 
CCTTGCAGCA 
ATGTTACCGA 
GGAGAAAGGC 
GTGGTGAAGC 
GGAAGGATGA 
GATTAATAAT 
ATCACACCTG 
GGGATCAAAG 
AGATCTACTA 
ATTTCTCTTA 
CCCAAATACT 
GATGGAAATC 
ACGTATACAA 
TTCACTTCTG 
TTACAATGTG 
TGTGTGCAAA 
AAGACTTATA 
AGGGCGCGGG 
CTTTGATCGA 
ATAGGACATG 
TGACCTGAAG 
ATGTGAATCC 
GTCATCATCA 
GTgCGAAATT 
CAATCCTATC 
TTGTTTGTCT 
AAAAAAA 



CTAACAATCC 
TCCGTATAGC 
TCATCGGAAA 
TGGTGCACGA 
CACCATAAGA 
TTGTTGGTTC 
TTTACAACGC 
ACGGGTCCTT 
TTAAGGCGAA 
CTCGAGCTTT 
TTTGTGGGGT 
TGGAACACAT 
AGAGTTCGAG 
CTACTTTGTC 
AAGAGAAAAA 
AACACAGAGG 
GTTGATGGTT 
CCATGGGGGT 
GCTCCTCTGG 
GGCGGTGATG 
CAGGGCTTTT 
GCCCCAGGAT 
TGTAATGAGA 
AGGGAGGTGA 
CACACCTGTT 
AGGTGAAAAC 
TATCTTTTGG 
ATTGATTTTC 
TCGTGACGTG 
CTATCGCTGA 
CATTATGTTG 
TGGAATTGCT 
GGTCAAATCC 
TACTCATTAT 
GCTTAACCAG 
CTCACGAGGA 
GATGGGGATG 
AGCGTGGCGT 
TAAGAGAATA 
ACGCAGAGTG 
AGATATCATG 
ACCAGGTCCA 
CTGTGAATAT 
AATACTTTGA 
TGATGCTCAA 
CTCAATACCT 
TAATGTTTGA 
TGTCAAAAGC 



CTAATCTGTG 
TTAACCTGGT 
AAGCCGACGG 
TGAGGATTCG 
AATCGAACGG 
ATTGGGTGTG 
AATGCCTGCG 
TGCCTGACCC 
ACATCCTGTT 
GGGAAGGCAA 
GGAACTTTTG 
GTCACTTGAC 
CTGTATCAGG 
TGGGCAGTGC 
TATGTACATG 
TACGTGATCA 
TCTACCAACG 
CTTGTATTTT 
GTGGCGGGGG 
AACATTGGTG 
CTCTGCTGAA 
TCTTAGACAC 
ATGACACGCA 
CACGATATGG 
GTGGGAAAAA 
GGAGTTTCCA 
GAAAGAAGTA 
GAGGTGCTGT 
TGGACAGAGT 
GTATAAGGTC 
CTCCTAAGAT 
GATGATTTGG 
GTTAGAGACA 
ACGGAGTGGG 
TCTCCCGACA 
GGACGAAGAT 
AAACAGTACC 
GGCAAGACAA 
CAATCACTCT 
GTGCCCATGT 
AGGGTTGCCG 
CTCTGGCATA 
CATGATCTCT 
AAGAGAGATC 
CTCACAAAGA 
CTTTAATATT 
ACGATAACAC 
ATCAATTTGT 
2427 
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<210>™6 ~" ' ~ 

<211> 671 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 6 

MPLIHRKKPT EKPSTPPSEE WHDEDSQKK PHESSKSHHK KSNGGGKWSC 50 

IDSCCVJFIGC VCVTWWFLLF LYNAMPASFP QYVTERITGP LPDPPGVKLK 100 

KEGLKAKHPV VFIPGIVTGG LELWEGKQCA DGLFRKRLWG GTFGEVYKRP 150 

LCWVEHMSLD NETGLDPAGI RVRAVSGLVA ADYFAPGYFV WAVLIANLAH 200 

IGYEEKNMYM AAYDWRLSFQ NTEVRDQTLS RMKSNIELMV STNGGKKAVI 250 

VPHSMGVLYF LHFMKWVEAP APLGGGGGPD WCAKYIKAVM NIGGPFLGVP 300 

KAVAGLFSAE AKDVAVARAI APGFLDTDIF RLQTLQHVMR MTRTWDSTMS 350 

MLPKGGDTIW GGLDWSPEKG HTCCGKKQKN NETCGEAGEN GVSKKSPVNY 400 

GRMISFGKEV AEAAPSEINN IDFRGAVKGQ SIPNHTCRDV WTEYHDMGIA 450 

GIKAIAEYKV YTAGEAIDLL HYVAPKMMAR GAAHFSYGIA DDLDDTKYQD 500 

PKYWSNPLET KLPNAPEMEI YSLYGVGIPT ERAYVYKLNQ SPDSCIPFQI 550 

FTSAHEEDED SCLKAGVYNV DGDETVPVLS AGYMCAKAWR GKTRFNPSGI 600 

KTYIREYNHS PPANLLEGRG TQSGAHVDIM GNFALIEDIM RVAAGGNGSD 650- 
IGHDQVHSGI FEWSERIDLK L 671 



) 
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<210> 7 
<211> 643 
<212> CDNA 
<213> Zea mays 

<221> CDS 

<222> (1) . . (402) 



<400> 6 „„„ „_ . - 

CGG GAG AAA ATA GCT GCT TTG AAG GGG GGT GTT TAC TTA GCC GAT GGT 48 
Arg Glu Lys He Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
15 10 15 

GAT GAA ACT GTT CCA GTT CTT AGT GCG GGC fAC ATG TGT GCG AAA GGA 96. 
Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 

TGG CGT GGC AAA ACT CGT TTC AGC CCT GCC GGC AGC AAG ACT TAC GTG 144 
T^-p Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys Thr Tyr Val 
35 40 45 

AGA GAA TAC AGC CAT TCG CCA CCC TCT ACT CTC CTG GAA GGC AGG GGC 192 
Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 



ACC CAG AGC GGT GCA CAT GTT GAT ATA ATG GGG AAC TTT GCT CTA ATT 240 
Thr Gin Ser Gly Ala His Val Asp He Met Gly Asn Phe Ala Leu He 



GAG GAC GTC ATC AGA ATA GCT GCT GGG GCA ACC GGT GAG GAA ATT GGT 288 
Glu Asp Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Gly 
85 90 95 

GGC GAT CAG GTT TAT TCA GAT ATA TTC AAG TGG TCA GAG AAA ATC AAA 336 
Gly Asp Gin Val Tyr Ser Asp He Phe Lys Trp Ser Glu Lys He Lys 
100 105 HO 

TTG AAA TTG TAA CCTATGGGAA GTTAAAGAAG TGCCGACCCG TTT ATTGCGTTC C 391 
Leu Lys Leu 



AAAGTGTCCT GCCTGAGTGC AACTCTGGAT TTTGCTTAAA TATTGTAATT TTTCACGC 449 
TTCATTCGTC CCTTTGTCAA ATTTACATTT GACAGGACGC CAATGCGATA CGATGTTG 507 
TACCGCTATT TTCAGCATTG TATATTAAAC TGTACAGGTG TAAGTTGCAT TTGCCAGC 565 
TGAAATTGTG TAGTCGTTTT CTTTACGATT TAATANCAAG TGGCGGAGCA GTGCCCCA 623 
AGCNAAAAAA AAAAAAAAAA 643 



65 



70 



75 



80 



115 
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<21tr>— 8 : : 

<211> 115 
<212> PRT 
<213> Zea mays 

<400> 8 

Arg Glu Lys lie Ala Ala Leu Lys Gly Gly Val Tyr Leu Ala Asp Gly 
1 5 10 15 

Asp Glu Thr Val Pro Val Leu Ser Ala Gly Tyr Met Cys Ala Lys Gly 
20 25 30 



Trp Arg Gly Lys Thr Arg Phe Ser Pro Ala Gly Ser Lys "Thr Tyr Val 
35 40 45 



Arg Glu Tyr Ser His Ser Pro Pro Ser Thr Leu Leu Glu Gly Arg Gly 
50 55 60 



Thr Gin Ser Gly Ala His Val Asp lie Met Gly Asn Phe Ala Leu lie 
65 70 75 80 



Glu Asp Val He Arg He Ala Ala Gly Ala Thr Gly Glu Glu He Gly 
85 90 95 



Gly Asp Gin Val Tyr Ser Asp He Phe Lys Trp Ser Glu Lys He Lys 
100 105 HO 



Leu Lys Leu 
115 
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<2ro>-^9" — 

<211> 616 
<212> cDNA 

<213> Neurospora crassa 



<400> 9 

ggtggcgaag acgaaggcgg 
agatggatct accccccaga 
gtntacngtt tntatgggta 
tggcgcccga tcccgggacg 
actttgactn aggggcacat 
tggcacagtg aaccttatga 
aaatgaagag atacaaccct 
ccgcatgaac cagaacggtt 
tcacgtggat attctaggaa 
tggcggqagg tcgaggcgat 
cttaaatatg tagaaaaggt 
acataggtta ctcaatagta 
aaaaaaaaaa aaaaaa 



aagttggagg ctaacgagaa 
gacacgacta ccnttgcacc 
ggaagccgac ggagcgagcc 
acaacgcatc ttcagacgac 
tgaccacggt gtgattttgg 
gtttggggta cctgtgcaat 
gcgggctcaa aaataaccgt 
caatccgaga ggagggccga 
ggcagaatct aaacgagtac 
acaattgagg attttattac 
tgaaatttat gaagagtaat 
tgactaatta aaaaaaaatt 

616 





50 


cagcctcaag 


100 


tacatctatc 


150 


gaccgacacg 


200 


gcgaaggcga 


250 


aaggggtgga 


300 


ggtcgagatg 


350 


atacggcgga 


400. 


attcttaaag 


450 


tagtaatatt 


500 


taaatacggc 


550 


ttttttctaa 


600 
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<21W 
<211> 1562 
<212> genomic DNA * 
<213> Arabidopsis thaliana 



<4oo> ia 

ATGAAAAAAA TATCTTCACA TTATTCGGTA 
GGTGACGATG ACCTCGATGT GTCAAGCTGT 
TGATTCTGGT TCCAGGAAAC GGAGGTAACC 
AGAGAATACA AGCCAAGTAG TGTCTGGTGT 
TCATAAGAAG AGTGGTGGAT GGTTTAGGCT 
TATTGTCTCC CTTCACCAGG TGCTTCAGCG • 
GACCCTGATT TGGATGATTA CCAAAATGCT 
TCCTCATTTC GGTTCGACCA. AATCACTTCT 
GGTTAGTACT TTCCAAGATA TATCATTTTG 
AAATAGACAT AAATTTGGGG GATTATTGTT 
GCTAGTCGGT AATGTGAGTG TTATGTTAGT 
GTGATTTTCC ATTTTAAATG AAGCTAGAAA 
CTATGTCATG AGAATTATAA GGACACTATG 
GGTTTGATTT GCAGAGATGC CACATCTTAC 
) TCTAGAGAAA AAATGCGGGT ATGTTAACGA 
CATATGATTT CAGGTACGGC CTGGCTGCTT 
GCCTCACAGT TCCTACAAGA CCTCAAACAA 
CGAGAACGAA GGAAAGCCAG TGATACTCCT 
TTTTCGTCCT CCATTTCCTC AACCGTACCA 
TACATCAAAC ACTTTGTTGC ACTCGCTGCG 
TCAGATGAAG ACATTTGCTT CTGGCAACAC 
ACCCTTTGCT GGTCAGACGG CATCAGAGGA 
CTACTTCCAT CTACCAAAGT GTTTCACGAC 
AACTCCCCAG GTTAACTACA CAGCTTACGA 
ACATTGGATT CTCACAAGGA GTTGTGCCTT 
TTAACAGAGG AGCTGATGAC TCCGGGAGTG 
GAGAGGAGTT GATACACCGG AGGTTTTGAT 
ATAAGCAACC AGAGATTAAG TATGGAGATG 
GCGAGCTTAG CAGCTTTGAA AGTCGATAGC 
TGGAGTTTCG CATACATCTA TACTTAAAGA 
TTATGAAGCA GATTTCAATT ATTAATTATG 
GTCAATGAAT GA 



GTCATAGCGA 


TACTCGTTGT 


50 




GGGTAGCAAC 


GTGTACCCTT 




100 


AGCTAGAGGT 


ACGGCTGGAC 




150 


AGCAGCTGGT 


TATATCCGAT 




200 


ATGGTTCGAT 


GCAGCAGTGT 




250 


ATCGAATGAT 


GTTGTACTAT 




300 


CCTGGTGTCC 


AAACCCGGGT 




350 


ATACCTCGAC 


CCTCGTCTCC 




400 


GGACATTTGC 


ATAATGAACA 




450 


ATATCAATAT 


CCATTTATAT 




500 


ATAGTTAATG 


TGAGTGTTAT . 




. 550 


GTTGTCGTTT 


AATAATGTTG 




600 


TAAATGTAGC 


TTAATAATAA 




650 


ATGGAACATT 


TGGTGAAAGC 




700 


CCAAACCATC 


CTAGGAGCTC 




750 


CGGGCCACCC 


GTCCCGTGTA 




900 


TTGGTGGAAA 


AAACTAGCAG 




. 850 


CTCCCATAGC 


CTAGGAGGAC 




900 


CCCCTTCATG 


GCGCCGCAAG 




950 


CCATGGGGTG 


GGACGATCTC 




1000 


ACTCGGTGTC 


CCTTTAGTTA 




1050 


CCTCCGAGAG 


TAACCAATGG 




1100 


AGAACTAAAC 


CGCTTGTCGT 




1150 


GATGGATCGG 


TTTTTTGCAG 




1200 


ACAAGACAAG 


AGTGTTGCCT 




1250 


CCAGTCACTT 


GCATATATGG 




1300 


GTATGGAAAA 


GGAGGATTCG 




1350 


GAGATGGGAC 


GGTTAATTTG 




1400 


TTGAACACCG 


TAGAGATTGA 




1450 


CGAGATCGCA 


CTTAAAGAGA 




1500 


AATTAGCCAA 


TGTTAATGCC 




1550 



1562 
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<210> 1~1 - ■ 

<211> 3896 

<212> genomic DNA 

<213> Arabidopsis thaliana 

<400> 11 

ATGGGAGCGA ATTCGAAATC AGTAACGGCT TCCTTCACGG TCATCGCCGT 50 

TTTTTTCTTG ATTTGCGGTG GCCGAACTGC GGTGGAGGAT GAGACCGAGT 100 

TTCACGGCGA CTACTCGAAG CTATCGGGTA TAATCATTCC GGGATTTGCG 150 

TCGACGCAGC TACGAGCGTG GTCGATCCTT GACTGTCCAT ACACTCCGTT 200 
GGACTTCAAT CCGCTCGACC TCGTATGGCT AGACACCACT AAGGTCCGTG 250 

ATCTTCATTT CCTTCGCTCC TTATTCTGTC GGTCGAGTCA CTTGTTGATG S00 

AATTCCAAGC GAAATATAGC AATGAAGCAT GTCTCGTCTC TCTTATTGAT 350 

TCGTTCATTA GTCAACAGTG ACGCTTCTGA ATCTGAGTTT AGAGTCATAT 400 

AAAACAGCTG ACTCGGCGAG TGTTTCCCAT CGCTTTTGGT TCGCTAAATG 450 

TAGCGCAATG AATGTGTAAT TAGTCTGCGC TTTTTATTCA ACTAGATCTG 500 

CAAGTTTTTC AGAGTGCTGA ATAGTAGTTA GAAAATGTTA GGTCATTTTA 550 

CTTGTGCATT GTGATTCTTT TGGTTGTTGC TTACTGATCG ACGTGATGGA 600 

TGGTTTACAG CTTCTTTCTG CTGTCAACTG CTGGTTTAAG TGTATGGTGC 650 

TAGATCCTTA TAATCAAACA GACCATCCCG AGTGTAAGTC ACGGCCTGAC 700 

AGTGGTCTTT CAGCCATCAC AGAATTGGAT CCAGGTTACA TAACAGGTAG 750 

TTTCGGATTT TTCTTTCTTT TGAGTTTTCT TCAATTTGAT ATCATCTTGT 800 

TGTGATATAA TATGGCTAAG TTCATTAATT TGGTCAATTT TCAGGTCCTC 850 

TTTCTACTGT CTGGAAAGAG TGGCTTAAGT GGTGTGTTGA GTTTGGTATA 900 

GAAGCAAATG CAATTGTCGC TGTTCCATAC GATTGGAGAT TGTCACCAAC 950 

CAAATTGGAA GAGCGTGACC TTTACTTTCA CAAGCTCAAG TTAGTCCTTA 1000 

TCAGGCTAAT GTCTTTTATC TTCTCTTTTT ATGTAAGATA AGCTAAGAGC 1050 

TCTGGTCGTC TTCCTTTTTG CAGGTTGACC TTTGAAACTG CTTTAAAACT 1100 

CCGTGGCGGC CCTTCTATAG TATTTGCCCA TTCAATGGGT AATAATGTCT 1150 

TCAGATACTT TCTGGAATGG CTGAGGCTAG AAATTGCACC AAAACATTAT 1200 

TTGAAGTGGC TTGATCAGCA TATCCATGCT TATTTCGCTG TTGGTACCGG 1250 

CCTACTATCC TTAAGTTACC ATTTTATTTT TTCTCTAATT GGGGGAGTTA 13 00 

TGTTGTGACT TACTGGATTG AGCTCGATAC CTGATTTGTT GTTGATTTAG 1350 

GAGCTCCTCT TCTTGGTTCT GTTGAGGCAA TCAAATCTAC TCTCTCTGGT 1400 

GTAACGTTTG GCCTTCCTGT TTCTGAGGTG ACCTCTGACT TCTCTTTAGT 1450 

TTTAAGTAGT TGATATCAAC CAGGTCTTAT AACTCACTGG ATTTTCCTTT 1500 

TGAAAGTATT ACTTTTGTTA ATTGAACTGC TGTACGCGAT ATGGTATCTG 1550 

TAGATCTTGA AGTGCTAGTT ATCAAAGAAC ATATTGTGGG TAGTATACCT 1600 

GTCAGCGGCC TTAGCTAATA CAACCAAACC AC AT GT AC AC TGATTTAGTT 1650 

TTCAGATTAT TATGGTAGAC TTTAAGTTGA GAAGAAACTT TGACTGAAAT 1700 

CTTTTTATTT TAATAGGCTA TGATTTGTTT ATTGAAATCA TGTGACATAT 1750 

TGACATGCGC TTCTCATGTT TTTTGTTGGC AAGGCTTCAG GGAACTGCTC 1800 

GGTTGTTGTC CAATTCTTTT GCGTCGTCAT TGTGGCTTAT GCCATTTTCA 1850 

AAGAATTGCA AGGGTGATAA CACATTCTGG ACGCATTTTT CTGGGGGTGC 1900 

TGCAAAGAAA GATAAGCGCG TATACCACTG TGATGAAGAG GAATATCAAT 1950 

CAAAATATTC TGGCTGGCCG ACAAATATTA TTAACATTGA AATTCCTTCC 2000 

ACTAGCGGTT AGACTCTGTA TATGCAACTG TAACACTAAC AAAAGTTTCA 2050 

CCAAGAATGT TCACTCTCAT ATTTCGTTCC TTTGATGTGT ATCCATCAGT 2100 

TACAGAAACA GCTCTAGTCA ACATGACCAG CATGGAATGT GGCCTTCCCA 2150 

CCCTTTTGTC TTTCACAGCC CGTGAACTAG CAGATGGGAC TCTTTTCAAA 2200 

GCAATAGAAG ACTATGACCC AGATAGCAAG AGGATGTTAC ACCAGTTAAA 2250 

GAAGTACGTA CCTTTCTTTG TGATAAGAAA TATTGCTCAT CGATCATCAC 2300 

TTGCTGGCTT CTTGTACGTC AAATTGTTTT GTTTAAATCT CTATATCAAT 2350 

TGTTCATATG CTTTGTCTTT CTTACTATAA GAAACAAGTA TAATCAGAAA 2400 

CCTTATTATT GATTATCAGT TCTCTCCTTA TATTATGGAA TGTCTTTTTC 24 50 

GTTTACAGTT ATGAATGCAA AAGGGGGTAT TTTAGTTGAT TGATTCTCTC 2500 

ATTCTCTAGT TTGTTTTGAC TAATAGCGTC AATTTTGTTT TTCTAGCAAA 2550 

TCTTTGTGAA TTATATATAA CATGCTAACT ATACTTTTCA GGTTGTATCA 2600 

TGATGACCCT GTTTTTAATC CTCTGACTCC TTGGGAGAGA CCACCTATAA 2650 

AAAATGTATT TTGCATATAT GGTGCTCATC TAAAGACAGA GGTATGATGC 2700 

- ATTCTCAATA TCACATTATG CGTTGACTTT GTTATTATAT TCCCCATTTG 2750 

GTTTGCAATA TCTTTTTGAA TTATGATTTA TCTTCTCCCT TGCATCTTAT 2800 

GCTATTAAGC GTTAAAGGTA CTAAATGTAT GAAGCTGTCT GTCATAGGTT 2850 
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GGTTATTACT 
CACGGATATC 
CCGCAATGGC 
GTGGCATGTT 
AAGTACTTTT 
AAGTGGGAAG 
AGCAAAACAA 
GTGCTTTTAA 
GATTGTGCAA 
CCTATAACTG 
CTTCTTGCAA 
CTTGCTATGT 
TGATTATGAA 
GCAAGAATTG 
CTCTTTTTTA 
CTGGTTATGT 
TCTCTGTACT 
GAAAATAAAA 
TAAATGTTGA 
GCACCAAGGG 
GGGGAAGAGA 



TTGCCCCAAG 
ATTTATGAAA 
AGAAGTAAAA 
ATCTCAGTTG 
TTATCATTCC 
AGGTGTTGCA 
AACTAACCCA 
AAAATTTGTT 
TATCTGCAGG 
GGGATGAGAC 
ACTACTGAAG 
TCTCTAGTAC 
ATTGATCTCT 
GCTCGGACCT 
GTTCCTCACC 
GTTGATTTAC 
CCTCAAGAAC 
CAACAGCCAG 
TCATGAGCAT 
TTAAGTACAT 
ACCGCAGTCT 



TGGCAAACC'T" 
CTGAAGGTTC 
CAGGAAGGCA 
CATAAGCAAA 
TTTTGAGCTT 
TGAAACATGA 
TTTCTGAATT 
TTAAGAAACC 
TCTGGAACTG 
GGTAAGCTCA 
ACTAAGATAA 
ACTGCAATAT 
TATAGGTACC 
AAAGTTAACA 
TTATATAGAT 
CTCCAATTTG 
TTGTATTAAT 
AACACGATGG 
GGGTCAGACA 
AACCTTTTAT 
GGGAGCTTGA 



"TATCCTGATA 
CCTCGTGTCA 
AAGTCTTCTG 
TTATTAAACA 
AGTGGATGAT 
CACTTGTATC 
TCATATTATT 
GAAAAACTAG 
TGGTTGATGG 
GAAGTTGGTT 
TACTTGCTTC 
TGACTCTCCG 
CTATCATTCA 
TAACAATGGC 
CAAACTTTAA 
TTCTTTCTAA 
CTAAACGAGA 
AAGCGACGTA 
TCATAGCTAA 
GAAGACTCTG 
TAAAAGTGGG 



AT.TGGATCAT" 



AGGTAATTTT 

TATCAGTCTA 

ACTAAAATTT 

CAGTGGCTTA 

AAAGATAACT 

AGGAGTAGTC 

TTCATATCTT 

GAACGCTGGA 

TTGAAATTAT 

TGGAACACTG 

CTACTTTTAT 

CTCTCTTGGT 

TCCCCAGGTA 

GTGTACTTTT 

AAATCATATA 

TTCTCATTGG 

CATGTGGAAC 

CATGACAAAA 

AGAGCATTCC 

TATTAA 



3300 



-2T9"0t) — 
2950 
3000 
3050 
3100 
3150 
3200 
3250 

3350 
3400 
3450 
3500 
3550 
3600 
3650 
3700 
3750 
3800 
3850 



3896 



) 
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<210>' 12 — " 

<211> 709 
<212> CDNA 

<213> Lycopersicon esculent um 



<400> 12 

CTGGGGCCAA AAGTGAACAT AACAAGGACA 
TCAGATGTAC AAGTGCATCT AAATATAGAG 
CATTCCCAAT ATGACAAAGT TACCTACAAT 
AGGATTCTGA AAGTTTTCCA GGGACAAGAA 
AAAGCAAATC ACAGGAACAT TGTCAGATCT 
GTGGCTTGAG ATGTGGCATG ATATTCATGC 
TTACAAAAGG TGGTGTCTGA TCCTCACTAT 
GTTTGTATTG ACATTGTAAG TATTGCAACA 
TGAGGGATGA GGACTGCTAT TGGGATTACG 
GCTGAACATT GTGAATACAG GTTAGAATAT 
TATTCTCTTT TTGTGTATTT AGGCCACCTT 
GATATGTATT CGGGGATGTT CACCTGGGAC 
TCTACATCTC ACATCCTGTC ACACTATGTG 
TTGGCGGAAC AACAAGTTTG CACAAACATT 

TTCAGAGAG 



rrAPAGTC AG AGCATGATGT 


50 


ruTCk AfATG GTGAAGATAT 


100 


GAAGTACATA ACCTATTATG 


150 


CAGCAGTTTG GGAGCTTGAT 


200 


CCAGCTTTGA TGCGGGAGCT 


250 


TGATAAAAAG TCCAAGTTTG . 


300 


TTTCTTCTAT AAATGTTTGA 


350 


AAAAGCAAAG CGTGGGCCTC 


400 


GGAAAGCTCG ATGTGCATGG 


450 


TCAAATTATA TTTTGCAAAA 


500 


TCCCCGGTCA CAACGATGCA 


550 


AGAGTTGCAG ATTGAAGAGT 


600 


TGATATTTAA GAAACTTTGT 


650 


TGAAGAAGAA AGCGAAATGA 


700 


.709 
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<210> 13 
<211> 623 
<212> PRT 

<213> Schizosaccharomyces pombe 
<400> 13 

MASSKKSKTHKKKKEVKSPIDLPNSKKPTRALSEQPSASETQSVSKKSRKSKFGKSLNFII/3AILGICGA70 
FFF AVGDDNAVFDP ATLDKFGNMLGS SDLFDD IKGYLS YNVFKDAPFTTDKPSQS PSGNEVQVGLDMYNE1 4 0 
GYRSDHPVIMVPGVISSGLESWSFNNCSIPYFRKRLWGSWSMLKAMFLDKQCVttEHLMLDKKTGLDPKGI210 

KLRAAQGFEAADFFITGYWIWSKVIEN^ 

IVHKKKVVLlSHSMGSQVTYYFFKWtfEAEGYGNGGPTWVOTHIEAFIN 

TGIVTTLNILEKFFSRSEPJVMMVRTMGGVSSMLPKGGDVAPDDLNQTOFSNGAIIRYREDIDKDHDEFDI420 
DDA^FLKNVTDDDFKVMLAKNY SHGLAVTTEKEVLKNNEMPSKWINPLETSLPYAPDMKIYCVHGVGKPT4 9 0 
ERGYYYTNNPEGQPVIDSSVNDGTKVENGIVMDDGDGTLPII^^ 

HEPAAFDLRGGPRSAEHVDILGHSELNEIILKVSSGHGDSVPNRYISDIQEI1NEINLDKPRN 623 



.) 
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<210> 14 ' ' ~ * — ----- - - 

<211> 432 • 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 14 

MKKISSKYSWIAILVWTMTSMCQ^ 

SGGWFRLWFD AAVLL S PFTRC F SDRMML Y YD PD LDDYQNAPGVQTRVPHFG STKSLLY LDPRLRDATSYM1 4 0 
EHLVKALEKKCGYVNDQTILGAPYDFRYGLAASGHP 1 0 

GGLFVLHFLNRTTPSWRRKYIKHFVALAAPW 80 
LPSTKVFHDRTKPLVVTPQVNYTAYEMDRFFADIGFSQGVVPYKTRVLPLTEEIM 

TPEVIJ^GKGGFDKQPEIKYGDGDGT^ 

NYELANVNAVNE 432 



) 



) 
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<2io> is" ' " ~ "~ ' ~ " ~ ~ " . 

<2I1> 552 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 15 

MGANSKSVTASF TVIAVFFLICGGRT AVE DETEFHGDYSKLSGI IIPGFASTQLRAWSILDCPYTPLDFN7 0 
PLDLVWLDTTKLLSAVNO^KCMVLDPYNQTDHPE 

VEFGIEANAIVAVPYDWRLSPTKLEERDLYFKKLKLTFETALKLRGGPSIVFAHSM 

EIAPKKYLKVfaDQHIHAYFAVGAPLLGSVEAIKST^ 8 0 

KGDNTFWTKFSGGAAKKDKRVYHCDEEEYQSKYSGWPTNI INIEI PSTSARELADGTLFKAIEDYDPDSK3 5 0 
RMLHQLKKWPFFVIRNIAHRSSLAGFLLYHDDPVFNPLTPWERPPIKNVFCIY 
KPYPDIWIITDIIYETEGSLVSRSGTV^ 

KQQPEHDGSDVHVELNVDHEHGSDI I ANMTKAPRVKYITFYEDSE S I PGKRTAVWELDKSGY 552 



) 
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5 



10 



15 " 

<170> Patentlx* Ver . 2.0 

<2I0> 10c 
<211> 661 
20 <212> POT 

<2I3> Sacclaaromyces cerevisiae 

;"°GXy Thr Leu Phe Arg Arg Asa Val Gin Asa Gin Lys Ser Asp Ser 
25 I 5 10 

Asp Glu Asa Asa Lys Gly Gly Ser Val His Asa Lys Arg Glu Ser Arg 

30 Asr. His lie His His Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 

He Ser Glv Ser Ala Lys Arg Asa Glu Arg Gly Lys Asp Phe Asp Arc 
50 - 55 °° 

35 Lvs Arg Asp Gly Asa Gly Arg Lys Arg Trp Arg Asp Ser Arg Arg Leu 



55 70 
lie Phe lie Leu Gly Ala Phe Leu Gly Val Leu Leu Pre Phe Ser Phe 
40 85 90 

C--v Ala Tvr Kis Val His Asa Ser Asp Ser Asp Leu Phe Asp Asa Phe 
100 

. 45 Val Asa Phe Asp Ser Leu Lys Val Tyr Leu Asp Asp Trp Lys Asp Val 
115 -20 
La „ ? . 0 G1a Glv Il8 ser Ser Phe lie Asp Asp lis Gin Ala Gly Asa 
130 " 135- ^° 

50 Tvr Ser thr Sar Ser Leu Asp Asp Leu Ser Glu Asr. Phe Ala Val Gly 
145 150 1=3 

Lys. Gin Leu Leu Arg As? Tyr Asa lie Glu Ala Lys His Pro Val Val 
55 155 - ;o 

— v-3 ?r= Glv Val lie Ser Thr Gly lie Glu Ssr Trp Gly Val lis 
180 135 

60 Gly Asp Asp Glu Cys Asp Ser Ser Ala His Phe Arg Lys Arg Leu Trp. 
ia; 2QC 



30OCID: <WO 0080095A2 1 > 



SUBSTITUTE SHEET (RULE 26) 



WO 00/60095 _ _ __ _ PCT/EPOO/02701 

23/53" 



Sly ser ». Tvr Met Leu g • ~ ~ « ffl W * WS " 

, _ " Hie VeX Ket «u « « «- V f V ~ UP » - - 
225 

». •> L- U. Ua Ue «. «, - gj «« - « * Si ^ 
10 «, ^ W ~ » » «• «g *• «» » - SS ^ Ue 



«^ -rhr Se- Al» Ala Tyr as? Trp Arg Leu 
Glv Tyr GLu Pro Asn Lys Met Thr Se. A~ ax ^ 

15 * 275 

r «, Glu Ara Arg As* Arg Tyr Phe Thr Lys Leu Lys 
Ala Tyr Leu Asp Leu Glu ftrg «w - 3Q0 



2 Q 5 

1 290 



2 0 «, «. H. «» - £ Hie - « s «» - vil as 

305 

« ti a n- <>h° Tvr Phe Met Lys Trp 
li s Gly His Ser Met Gly Ser Gin He Ilj Tyr ^ 



325 

340 



25 Vei « U. 5 iu C-XV » «~ «. »g - «V 01V X. «J » 



_ . r n v mht Leu Leu Gly 

As* Glu His He Asp Ser Phe lie Asn Ala Axa Gly .J. 

30 , la Tan ~le Ser Sly Glu Met Lys Asp Thr 

Ala Pro Lys Ala Vai Pro Ala Leu lie Se. y ^ 

370 

35 U; «n U. -a - - »* «"* SS "° 

W lie C-iu tt, vel I.VS M .en Oin Thr «p «r «» J» ~ 
405 

.40 „, Val no Tro Glv Asp Met Lys Ser Ser 

' M-c Leu Pro Lys Gly Glu Glu Val u- -r? . 43Q 
420 

m .. .„ Gly Asn Phe He 

Ser Glu Asp Ala Leu As* Asn Asn Thr Asp ttu Tyr GJ 



45 435 

a se^ Asp Ala Phe Asn Lys Asn Leu Thr Met 

Arg Phe Glu Arc Asn Thr Se- as? 4fi0 



450- 453 



. _ m„ ti a Pro Glu Trp Leu Gin 
50 Lys Asp Ala lie Asn Met Thr Leu Se. — 480 

Z - val His Glu Gin Tyr Ser Phe Gly Tyr Ser Lys Asn Glu Glu 
435 

55 ,., Le ,. Kis Kis Lys His Trp Ser Asr. Pro Met 

Glu Leu Arg Lys Asn -i M * . 3 io 

500 



- ---<s H-e Lys He Tyr Cys He Tyr 

Glu Val Pro Leu Pre Glu Axa Pro —S H~ ^. g25 



60 515 320 
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■ »i = <th/- Val TV*" Lvs Glu Glu Asp 
Gly Val Asn Asn Pro Thr Glu Arg Ala. Tyr Val Ty. 

530 533 



mv,- rip asn TVr Glu Sar Lys Gin Pro 
Aso Ser Ser Ala Leu Asn Leu Thr He Asp iyr 360 

5 545 550 

Val Phe Leu Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Ser 
565 370 
10 Mat Cys Kis Lys Trp Ala Gin Gly Ala Sar Pro Tyr Asn Pro Ala Gly 
580 58:3 
lie Asn Val Thr Xle Val Glu M f Lys His Gin Pre Ajp Arg Phe Asp 

15 a. oiv w « «» «• »- S "* ^ *" 

610 615 
. Ala Glu Leu Asn Asp Tyr He Leu Lys He Ala Ser Gly Asn Gly Asp 
20 525 S30 

Leu Val Glu Pro Arg Gin Leu Ser Asn Leu Ser Gin Trp Val Ser Glr. 
645 65U 

25 Met Pro Phe Pre Met 
660 



<210> 2Ct 
30 <211> 387 
<2i2> PP.T 

<213> Arabidopsis thai i ana 



<400> 2 . ... .... Ua i t>- 0 Glv Asn Gly Gly 

35 Val Glv Sar Asn Val Tyr Pro Leu He Leu Va_ -O c__ ^ 

1 5 
• Asn Gin Leu Glu Val Arg Leu Asp Arg Glu Tyr Lys Pro Ser Ser Val 



20 25 



40 



Trp Cys Ser Ser Trp Leu Tyr Pro He His Lys Lys Ser Gly Gly Trp 
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Met: Pro Phe Pro Met 
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<212> PET . , 

<213> Saccharonvyces cerevxsiae 
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M 4 eJ°Gly Thr Leu Phe Arg Arg Asn Vol Gin Asn Gin Ly S Ser MP Ser 
1 5 i0 

15 As. Glu Asn in Lys Gly Gly Ser Val His Asn Lys Arg Glu Ser Arg 

20 25 
. Asn His He His Hia Gin Gin Gly Leu Gly His Lys Arg Arg Arg Gly 
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50 55 
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aga acs gta cac gag cjc tac tea tie gg= <== c «£ « c S? Ra ^aa 1410 

Arc Val His Glu Gin Tyr ser Phe Gly Tyr Scr Ly S Asr. Glu Glu 
422 4SC 423 

eae cca aes aaa aac gag eta cac =a = aac cac tec ccg aac cca acg 1536 
Glu Lou Arg Lys Asn Glu Leu Kis Sis Lys His Tr? ser Asa Pro Met 

caa S ta cca ctt cc^ gaa get ccc cac atg aaa acc tat feet aca cac 1S3* 
Glu Val Pro Lav ?rc Glu Ala Pre Sit Mat Lys He Tyr Cys He Tyr 
Sl5 520 S2S 

ec= ccc aac aac cca acfe gaa. agg cca cac gta tat aag gaa gag gat 1622 
Gly Val Asr. Asr. Pro Thr Glu Arg Ala Tyr v«l Tyr Lys Glu Glu Asp 
530 S3S 5*° 



cac tec tec get ctg aat teg ace ate cac tac gaa age aa= =« cct 
Lp sar Ser Ala Leu Asn Leu Thr Ha As? Tyr Glu Sar Lys Gin Pro 
S . 1£ 530 SS5 5S0 

eta tec etc acc cac ggg cac gga acc gtt ccg etc gtg gcg cat tea 
Val Phe Lou Thr Glu Gly Asp Gly Thr Val Pro Leu Val Ala His Sar 
SSS S70 57S 

ate tgt cac aaa egg gec cac ggt get cca ccg cac aac cct gec gga 
Ket cys Hie Lys Tr? Ala Gin Gly Ala Sar Pro Tyr Asa Pro Ala Gly 
S£0 



Iff 90 



172S 



177S 



SeS 5S0 



act aac ctt act ate etc gaa atg aaa cac cag cca gat cga ctt gat 1824 
He Aca Val Thr lis Val Glu Mac Ly= His Ola Pro Asp Arc Pns Asp 



SSS 



SOO S05 



1B72 



aca cgt =gc cca cca aaa age ccc gaa cac cca gac ate etc gec age 
He Arc Giv Gly Ala Lyc Ser Ala Glu His Val Asp He Leu Gly Ser 
610 • * SIS 620 

ccc cac ttc aac aat tac ate tag aaa att gca age cct aat ggc gat 1S20 
Ala Glu Leu Asn L? Tyr He Leu Lys He Ala Ser Gly Aca Gly As? 
S2 = 530 -5 ■ M0 

etc etc cac eca cgc caa afeg cct aat etc sec cag tgg gtt act cag 1SSS 
Lau Val Glu Pro Arg Gia Leu Scr Asr. Leu Ser Gin Trc Val Ser G,r. 

S45- €50 555 
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ate ccc etc cca atg taa 
HsC Pro Pha Pre Mat 



) 



) 
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<2ios 2 ir 

<21I> SSI 

<2ia> put 

c213> Sacchazomyc« c^revisiaa 



_ ^sp val Gin Ass Gin Lys Ser Acp'ser 
HAt Cly Tfcr Leu Pas Arg Ar* ast. v*j. ^ 

M, clu «, ,.r. L~ «y «y «« -1 kI. « ». « «• ~ « 

n. £ ^ «• «- «J ° ly *" L: " *3 ° lV 

IU s« =" s„ »!. Ly S « »' «» «» «* «* »" 
W. ^ x= 5 «y « «» '«■«*• «. "» W »» - ^ " S " 

xi, «. «, «. - *- «»» ^ - « ~ 

„y u. Hi, « s« «p « P «~ £ « ~ 

« « ». r.; ~ - w i». - « « s ~ w vsi 

" s „ -,. a. mp w «■ =1 » cly " = 

leu ?ro Glr. cly J.l« «•- -•- 

130 " *„ T,au S« Glu Asn »*• Ala vai Gly 

Tyr Sar Thr Ser s« Leu As? As? Lau Sft- e-_ ^ 

£ ci, « « "p m. xu ~ - «*. - - ~ 
vn ,~ «, "i .u « ~ «v - - °» « f s - - 

130 * S «r =e- III Hie Phe Ar S Lys MS 

Gly Xs ? Asp Olu cys Asp Ser Se. Aift ^ 

125 v - v-i Asu Lys Val cys Trp 

Glv Se= »h. Tyr M» W« Ttr V " ^0 

210 ■ " «-u^ G W X-eu ?wso Pre ?r= ?vsn 

Mu Lvs Hi* Vai Met teu MP »*► ^ ^ " " 240 

223 230 cl- -h- ^sc Tyr 2?^-e lie 
Pha l-u Arc Ala Aia Oln Glv Pea Glu Se *- 

243 > ' ttf « Va i =he Glr. A« X.-SU Gly V*l lie 

Ala. Gly Tyr Tr? lie Tr? Asr. bys Va, -ne _ ?o 

2GC .1- s.-» t v - ASt5 T— Are Leu 

Gly tyr Glu Pre Aar. Ly S Mec Thr Ser «l. — -r- *^ 

272 290 » tv- Pi» Thr Lys ieu Lys 

Al t ryr Leu A= ? Leu Glu Ars W5 *iP — ^ * 3CC 

23» 235 , . S iv Glu Lys Val C>-5 Leu 

Glu Cl= SI. Glu Leu Kt. Ei= Glr. ,eu S~ 8-i 



305 510 
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IU oly Ki. K« 01, u= cm 11= II. «« Ty« »>• »« «, 

V.1 ou »• =1" «' »». — "* ° ly " 5 

» «. KU IU X. ? Se: Ph. IU « XU XU CIV - - "V 
XU «. Lyl XU v.! P » xu £ n. 3« «v «« »« «" "> * 
a. o" uu Asr. a, u. ™ «« V «r «| «• -v. «- J- 

x" n. Xr S v.! £ H.C Uu OU * W "V «» »° n: "* 
«. ,=» *r= lvb c" =u ci u v,l lie Z «r ^ H« s.z ur 

i! "> It 



420 Th>- Tvr Gly Asn Phe lis 

£ftr Giu Asp L^u Asn Asr. fcsn Thr Aap Th. Tyr y 

Ala Phe Asr. 



425 * n a Asr. Lvs Asn Leu Thr Kftt 

xrc 9hs Glu Ara Asn Thr Saz Asp Ala Phe Asr. 

'* S0 * 1-1 e*- t>ra Glu TrP LS'JL. Gift 

bye Asp Ala He Ma MC Thr Leu Scr He -« 480 

453 n . ^ rlv Tv - Ser Lys Asn Glu Glu 

Arc Arc v»l Kla Glu Gin Tyr ser Phe Gly ty- Se- W 

i ac 430 

t n„ t-u Hi* Lys His Trp ser As* Pro Met 

Glu Lsu Arg Lys Asn Glu Leu KX- . — ^ ' 510 ■ 

-nfl H OS 

m . v,i L « =i" S£5 »■ M.t *. xu w* ey. a. w» 

al y V.1 M. J-- P*= •> «U xx. XU ^ V,! IF I*. «» •»» M 

230 " „•«,_ tu* Glu ser Lvs Gin Pro 

Asp ser ser Ala Leu Mr. Leu Tar He Asp JV* sS(j 

5,12 , ' tV * M riv -n~ V»l ?ro Leu Val Ala Kis Sfer 

) Val Yfce Leu Thr Glu Gly As? Gly -U- vax 

^ - tsl-» Giv Ala Ser Sro Tyr Asa Pro Ala Gly 
Mat cys Kis Lys Trp Ala C-lr. Gly rJ-a ^ sgo 

,U »- v.! t" IU V.1 OU M l" Hi. ««• - «J « - 

u. u. "v «y xia ^ « ~ w Ile "» ° ly 

510 !,! t... tv- II- Ala Ser Gly Asr. Gly As? 

Ala Glu Leu Asa As? Tyr lis ^ ^' ^1 S40 

225 630 .»„ c„I rir m Val Ser Gin . 

Leu Val Glu Pro Ar= Gin Leu Ser A*r. Sa. Gin Tr? 



543 

Ms* ?rc «?h- Pro M- - 
£50 
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<2ic> air 

<2U> 2312 

<2l?.> esnorr.ic DMA. 

<213> Schisosacc r.a r omy c e s .pcmbe 

<400p 3 

ATGGCGTCTT CCAAGAACAG CAAAACTCA7 AAGAAAAASA AAGAACTCAA 50 

ATCTCCTATC GAC77ACCAA A7TCAAAGAA ACCAAC7CSC GCTTTGAGTG 100 

AGCA^CCTTC AGCGTCCGAA. ACACAATC7G TTT C AA A7AA ATCAAGAAAA 15 C 

Tn^ATTTG GAAAAAGAT7 G AATTTT ATA 7TGGGCGC7A TT7ZGGGAA7 200 

ATCCGGTG CT TTTTTTTTCG CTG7TGGAGA CGACAATGCT GTTTTCGACC 25 0 

1 CTGCTACGTT AGATAAATTT GGGAATATGC 7AGGC7CTTC AGACTTGTTT 2 00 

GATCACATTA AAGGATATTT ATCTTATAAT G7G77TAAGG ATGCACC7T7 35 0 

TACTACGGAC AAGCCTTCGC AG T CT C CT AG CG3AAA7GAA GTTCAA3TTG 4 00 

GTCTTGATAT GTACAATGAG GGATATCGAA G7GACCATCC TCTTATTATG 43 0 

GTTCCTGGTG- TTATCAGCTC AGGATTAGAA AGTTGGTCGT 7TAATAATTG SOC 

CTCGATTCCT TAC77TAGGA AACGTCTTTG GGGT AG CTGG TCTATGCTGA 5 SO 

AGGCAATG7T CC77GACAAG CAATGCTGGC TTGAACATTT AATGCTTGAT £00 

AAAAAA.ACCG GCTTGGATCC GAAGGG AAT7 AAGCTGCGAG CAGCTCAGGG 650 

GTTTGAAGCA GCTGATTTTT TTATCACGGG C7A7TGGA7T TGGAGTAAAG 7 00 

XAATTGAAAA CCTTGCTGCA A7TGGTTATG AGCCTAATAA CATGTTAAGT 750 

GCTTCTTACG ATTGGCGGTT ATCATATGCA AAT7TAGAGG AACGTGATAA 500 

. ATATTTTTCA AAGTTAAAAA TGTTCATTGA G7ACAGCAAC ATTGTACATA S50 

AGAAAAAGGT AGTCTTGATT TCTCACTCCA TGGGTTCACA' GGTTACGTAC 900 

X A TX rTTTTA AGTGGGTTGA AUCUVAGSGC TACGGAXS.TG GTGGACCGAC O50 

T7GGGTTAAT GATCATATTG AAGCATTTAT AAATGTGAGT CTCGATGGTT 1000 

G7TTGACTAC G7TTCTAACT T7TGAAXAGA TATCGGGA7C TTTGATTGGA 105 0 

CCACCCAAAA CAGTGGCAGC GCTTXTATCG GG7GAAA7GA AAGATACAGG 1100 

7A77GTAA77 ACATTAAACA 7GTTAATAT7 TAATTTTTGC 7AACCGTTT7 11S0 

AAGCTCAArZ GAATCA37T7 TCGGTCTATG GG7AAGCAAT AAX77GT7GA 1200 

GAmxrrrAC taatttactg tttagtzxsg aaa^attttt ttccccttct :.2^n 

GAGGTATA7T CAAAAATACA AATGTGCTCT ACT77T7C7A AC7T7TAA7A 1200 

GAGAGCCATG ATGGTTCGCA CTATGGGAGG AGTTAGTTCT ATGCTTCC7A 1350 

AAGGAGGCGA 7G7TG7ATGG GGAAATGCCA ' G77GGG7AAG AAATA7G7QC 14.00 
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TGTTAATTTT TIATT^-VTAT TTAGGCTCCA CWGWCTO ATCAAACAAA 
rrmCCiVNT GGTGCAATTA TTCGATATAG AGAAGACA7T CATAAGGACC 
;,CGATGAATT TGACAT..GAT GATGCATTAC AATTTTTA>A AAATG,.ACA 
GA7GACGATT TTAA^CAT GCMX*^ AATTA^CCC ACGGTCTTGC 

nonctwx t,*aaaataa cga^gccc tctaaatgga. 

"TMMCCGCI AGAAGTAAGA AC ATT AAAGT TACTAAATTA TACTAACCCA 
AATAGACTAG TCTTCCTTA7 GCTCCTGATA TGAAAATTTA T7GCCTTCAC 

ggggtcggaa aaccaactga gagaggttat t^ttatacta at^tcgtga 
Lgcaacct ctcattgatt cctcggttaa tgatggaac* aaagttgaaa 
^acaga atttatgttt caaacattct axtaactct, txatxaggg, 

ATTGTTATGG ATGATCCTGA TGGAACTTTA CC AAT ATTAG CCCTTGGTTT 

ggtgtgcaat aaagtttggc aaaca^g gtttaa«c* cctaatacaa 

\*fl~ tgaac ctgctgcgtt tgatct'gaga 

G7ATC AC AAA TTAT6AAATC AAGw~*uAA^ 

„^. r , r . xca-gtcgay macttcsac attcagacct 

GGAGGACCTC GCTCGGC^Grv ACA^^ua 

SlAATTT ctattactaa ctcttgaaat 

AAATGTATGT TCATYTTACC 

t •* ttca-caggc caxggtgact cggtaccaaa 

AAGGAAATXA TTTTAAAAG* uw rt .^uus. 

— x^-GGACA TAAGTrTTGT ACATTGCAAT 
CCCTTATATA TCAGATATCG AGTAv.GwAO> 

^- r ,^TUT^ JJTGAGAXAAA TCTCGATAAA 
TAACTAACTA ^CCGAACAGG GAAATAA^Aj ^ 

CCTAGAAATt AA 



1450 
15 QO 
1S50 
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iSSO 

1700 

17S0 • 

1300 
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19O0 

1S50 

2000 

2Q50 

2100 
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<2ll> 2S8S 



< ili.? ■* « ■* 



<<iOO> 4 



^GCGCGTTA TTCATCGGAA AAACCCGACG ^ 

• rrmflOJLTtC GCAAAAGAAA CCACACGAAT 
WCTGAAGAG GTGGTGCACG ATGAG.ATT. „ fi . TCGTGC 
^»iO AAATCGAACG GAGGAGGGAA OW-TCM**- 
CTTCCAAATC CCACCATAAG AAATC C ^ GGTGC3 TT 

1- OTTGGGTGT GTGTGT«TA~ C-x 

ATCGATTCTT GTTGTTGuTT CATTG 

_ . .^-rrccn CAGTAiG***- 

^CXCTTG GTTTAGAACG CAATGCCTGC 

C— . CACGGGTCCT ^= 

^GAAGGTG TTAAGGCOAA ^ 

,-^G-TTT GGGAAGGCAA ACAATGCGCT GAiGuTTT- 
CACCGGTGGG CTCGAGwTTT _ r ^ GT cT ? CAAAAGGTGA 

"TAGAAAACG TTTGTGGGGT GGAACTTTTG GTGAAGTCTr. 

' m CCTTTATATT GGGATTTGGA TTGGATCTGA 
r-TCAACAAT TCTCACTCTT CCTTT»taj. 

WW "»« BO * 

^-.-v- T-TTATCTT TACTTTTXTT TT~i 
TGTTTGTCTw r.TTACT * «r*"TAAGCG TTCCAAGGAC 

.^^„TTT CTGTATGTGT AAG%»*AA" w* 

GTAATTGGCT TGGACTATTT «G^ 

«CTT GGAATTTGAA QTCATCS „ 
ATGCCCTAOA GTCCGTTCAT AACAAGTTAC T.C.TTTA 
GTAGATTTAG CTTTGTGTAG CGTATAATGA AGTAGTG TTT ~ 

t , GXTCTAA CTAGATCTGT GGAAAGTGTG nCAG— 

TGGGAATAGA G«AGTXw*AA . {yfftATTAAAG 

TTATTCAACT ATGTAT>v--" 

GATAGAGGAC TGTTGCTTTA ttt^TTTC 
^.^-r CAGCTCAATG TGCTTTTCTC ^TITCnTC 
CTAGTTCCTT TTTwAi-TTI -j^-rTCSTC 

~„ CGAGTTTATT CACATGTCTT GA.TTTC . 

TOSWXTC*- — - . _ CCTCCCGXC C GTGCTATGGA 

CATCCTCC-TT CTGTTATCCA 0*X>**~ 

— TGT3GSTTG CATCTTTG. 

TATATTAAAA AAARASTGTT 

, rftS r GTTCATGTTT TMC^*— * 

, CAGTGGTATA G,w/>-^ J 

xxrcxt.vno i»— «- ^ 

««w 



50 

100 

150 

200 ■ 

250 

300 

350 

400 

450 

500 

550 

600 

(S5G 

700 

750 

200 



£50 

900 

550 

1000 

1050 

1100 

11B0 

1200 

1250 

1200 

1350 
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TOroCTCC. XCCCTCCXC. CXftCXXXGCX CCXGCCXP.CX XXGXCXCCCC 
^TGCXCAXX CCXAXCCXXC CACATATXGG -AXAXGAAGAG ^ 
ACA^GGCTGC AXATGACTGG CGGCXXXCCX XXCAGAAC A C XGAGGXXCXX 
XTCTCAXCCX XCXXXCXATt AXXCXGXTCC AXGTXACGTT TCXXXCXXCA 
XTA.CXTAAGG CTXAAATATG XTTCA.XGXXG AAXT AAX AGG ^ 
GACTCXXAGC CGXAXGAAAA ST*™ GTTGAXGGXX XCXACCAACG 
AGCAGTXAXA CXXCCGCAXX CCAXGGGGGX CXXGXAXXXX 
CX.CXXXXX XGAAGXGGGT TGAGCCACCA GCXCCXCXGG GXGG CGGGGG 

_„ S8 * r*TA.T * XX AA GCCGGTGAXG AACATTGGXG 
)tcggccagax TGGXGXGCAA AGXAT^TAA g 

WBMTICI XC^CC, ««««™ — CTCTGCTGAA 
=C»«0«= TO C« TO C WW «««*« AXAGTTTTOA 
CXX M =X=XO «««»» 

T ™=. xx™* xx==ct==~ "<™ TO ™ 

.^CNm XCTAACTSAA JvCXCXCTXGh CTT^M 
0OT *XX=XX M ==,TT=« ««WW «XXC»=« 

w ™ o^cxcxc* «B—e .««»»»» 

OTMUK XXCOTGCXX CC^C^, ««*?«. 

. CTrofATT XIC XX=X*X* x=cxxcx«= gt=tx«»k tcttctt^ 
, team gxcmxigsx tccxxsaxax 

OAM»ETC6 TTOSMTMT T«wTCTA-X» l-i 
) ACAACXXXAC XXXCXXCGAA AAXTCCAGAC CGAXXGCCCC AGGAXXCXX* 
■ CCACCC^XA X^XXXAGACX XC.GACCXXC — TCAGAATGAC 
^ GACXCAACAA XGTCXAXGXX accgaaggga -cacacga 
taxggggcgg ccxxgaxxgg xcaccggagaaaggccacac cxgxxgxggg 

« » C**A. AA-X-GXGGX CAAGCAGCXG AAAACGGAGT 
J^AXAGCAAA AGAACAAC-A AA-J.--J-- 1 - 

? X rTAXGGAAG GATGAXAXCX TXXGGGAAAG 

7TCCAACAAA AGXCCX«j.*-A JUiAJ««r— 

_ tx^ATAXTGA XTTXCGAGXA 

AAG-AGCAGA CGCXGCGCCA TCWS~T„ _ 

,CGAGAXAXA AAXC ATAAT A AACCXTGTAC AXXXXGXGA, XSXAXCAXSA 
AX^CXGXAC ATXXTAXCXG GXGAAGGGXG. CXGXCAAAGG XCAGAGXAXC 

_ r _ T — G A CGTGTSGACA G AGX ACCAXG ACAXCCGAXT 
CCAftAXCACA CCTGj.uw-.uA wu-vj 

. ""DACACT gcxgstga&g 

XGCXGGGAXC AAAGCTATCG CTGAGTATAA G^wXACA 
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c ™cx ^ cncc^ ag OT c ccgtggxgcc 
oc«™r en****** £CAAGTATCA 

, „, ^ MCCCTTWA GA.CAAAGTAA. GTGATTTCTr 

AGA.TCCCAAA. TACTw«-*CAA AJ.'-'- ^ 

._ rr _ ec TCCTGA.TGCA T7ATCAGTCT TTTTCTTTTC . 
GATTCCAACT STATCCTTCC TCCTCAiu 

' OCTCCT^O OH»«T> C=«AC»*= Ct=T^ 0 , 

0 ™» «-C=CTX 0 ^ *—««* """"" 

on««e xrac— ^cccn t==^=-c 

— «~ 

••■ TTCCXXCCCC ««" "T*^- 

=CTSTCTO~ «»»- K5CaOXTO ^ 

ea-n«w — ~« — «« 

, TO ,«==X TCCCO^CK ^ """"L 

ccecaae™ ccr^M —««» .»*— — 

»^-t=c nroKKau oatmcMoa coorrocco. 

CUXUOITS3 GiM-CTTTOC *** 

-I*- S-OT-nUk T«="=**<* C=«CTCC« tOMCKIW 
^^-^TT GACCTGAW3C TGTGA 



2900 

2950 

3000 

30S0 

3100 
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<210> 9 fy 
516 
<712> cDNA 

<213> Keurcsoora crasaa 
<4Q0>. a 



GGT5GCGAAG ACCAtfGGCGG AACTTGGAGG 


CTAACGAGAR, TGACtfCTCGG 


50 


AGATGGATCT ACCCTCTAGA GACACGACTA 


CCNTTCCACC CAGCCTCAAG 


ioo 


GTMTACNGTT TCTATGGGTA CGAAGCCGAC 


GGAGCGAGCC TACATCTATC 


ISO 


TGGCGCCCGA TCCCGCGACG ACAACGCATC 


TTTAGATGAC GATCGATACG 


2Q0 


ACTTTG ACTK AGGGGCACAT TGACCACGGT 


gtgattttgg gcgaaggcga 


2S0 


TGGCACAGTG AACCTTATGA GTTTGGGGTA 


CCTGTGCAAT AAGGGGTGGA 


300 


A&ATSAACAG ATACAATCCT GCGGGCTCAA 


AAATAACCGT GGTCGAGATG 


350 


CCGCATGAAC CAGAACCGTT CAATCCGAGA 


GGAGGGCCCA ATACGGCCGA 


soo 


CTTAAATATG TAGAAAAGGT XG AAATTT AT 


GAAGAGTAAT TAAATACGG C 


550 


ACATAGGTTA CTCAATAGTA TGACTAATTA 


, ^AAAAJSAXT TTTTTTCTAA 


COO 






516 


AAWNAAAAAA AAAAAA 
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TTAACAGAGG AGCTGATCAC TCCGGGAGTC CCAGTCACTT GCATATATGG 
GAGAOGAGTT GATACACCGG AGGTTTTGAT GTATGGAAAA GGAGGATTCG 
ATAAGCAACC AGAGATTAAG TATGGAGATG GAGATGGGAC GGTTAATTTG 
GCGAGCTTAG CAGCTTTGAA AGTCGATAGC TTGAACACCG TAGAGATTGA 
' TGGAGTTTCG CATACATCTA TACTTAAAGA CGAGATCGCA CTTAAAGAGA 
XTATGAAGCA GATTTCAATT ATTAATTATG AATTAGCCAA TGTTAATGCC 
GTCAATGAAT GA 



1300 
1250 
1400 
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1500 
1S50 
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<ilo> sjtr 

<2X2> genomic DNA 

<213> Arabidopsis thaliana 

<400> 9 



.MBttlWM TATCTTCMA XXATXCGGTA GXCATAGCGA XACXCGXXGT. 
GGTGACGATG ACCTCGATGT GXCAAGCTGT GGGTAGCAAC GTGTACCCTT 
' TGATTCTGGX TCCAGGAAAC GGAGGTAACC AGCTAGAGGT ACGGCTGGAC 
j AGAG AAXACA AGCCAAGTAG TGTCTGGTGT AGCAGCTGGT TAXAXCCGAT 
TCATAAGAAG AGTGGTGGAT GGTTTAGGCT AXGGTTCGAT GCAGCAGTGT 
TATTGTCTCC CTTCACCAGG TGCXTCAGCG ATCGAATGAT GTTGTACTAT 
GACr.CTa.TT TGO*TG*TTA CCAAAATGCT CCTGGTGTCC AAACCCGGGV 
TCCTCATTTC GGTTCGACCA AAXCACTTCT AXACCTCGAC CCTCGTCTCC 
GGTTAGTACT TTCCAAGATA TATCATTTTG GGACATTTGC ATAA7GAACA 
AAATAGACAT AAATXTGGGG GATXAXTGXT ATAXCAAXAX CCAXTXAXAT 
GCTAGTCGGT AATGTGAGTG XXATGXXAGT A,T AGTT AAX G TGAGXGTTAT 
GXGAXTTTCC ATTTTAAATG AAGCTAGAAA GTTGTCGTTT AATAATGTTG 
. CTATGTCATG AGAATTATAA GGACACTATG TAAATGTAGC TTAATAATAA 
GGTTTGATTT GCAGAGATGC CACAICTTAC ATGGAACATT TGGTCAAACC 
" TCTAGAGAAA AAATGCGGGT ATGTTAACGA CCAAACCATC CTAGGAGCTC 
CAXA7GATTX CAGGTACGGC CTGGCTGCTT" CGGGCCACCC GTCCCGXGTA 
GCCTCACAGT TCCXACAAGA CCTCAAACAA TTGGTGGAAA AAACTAGCAG 
CGAGAACGAA GGAAAGCCAG XGAXACTCCX CTCCCATAGC CXAGGAGGAC 
TXXTCGXCCT CCAXXXCCTC AACCGTACCA CCCCXXCAXG C-CGCCGCAAG 
TACAT'CAAAC ACTTTGTTGC ACTCGCTGCG CCA1GGGGTG GGACGAXCTC 
TCAGATGAAG ACAXTXGCTX CTGGCAACAC ACTCGGXGXC CCXXXAGXTA 
ACCC7XTGCT GGXCAGACGG CAXCAGAGGA CCTCCGAGAG XAACCAAXGC- 
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GTTTGTATTG ACATTGTAAG 'TATTGCAACA AAAAGCAAAG CGTCGGCCTC 400 

TGAGGGATCA GGACTGCTAT TGGGATTACG GGAAAGCTCG ATGTGCATGG 450 

GCTGAACATT GTGAATACAG GTTAGAATAT T C A? ATT AT A TTTTGCAAAA BOO 

TATTCTCTXT TTGTGTATTT AGGCCACCTT TCCCCCGTCA CAACGATGCA 550 

GATATGTATT CGGGGATGTT CACCTGGGAC AG AGTTG CAG ATTGAAGAGT GQQ 

7CTACATCTC ACATCCTGTC ACACTATGTG TGATATTTAA GAAACTTTGT £50 

TTGOCGGAAC AACAAGTTTG CACAAACATT TGAAGAAGAA AGCGAAATGA 700 
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